Comparing two Word 2000 documents, where one contains an elongated hyphen, results in all the text to the left of the hyphen being omitted from the line.
I have traced the bug to the plug-in catdoc.exe
Running catdoc.exe from the command line on this Word document gives an output which omits all characters on the offending line up to and including the hyphen.
When I open the Word document in a hex file viewer, the hyphen appears to be hex 96, not the usual 2D.
I don't know how the non-standard hyphen got into the document, though.
Bits of Word files missing
Re: Bits of Word files missing
We have no control over catdoc implementation, and it has its limitations. It's a quite popular tool so you may be able to find some information on this on the web.
psguru
PrestoSoft
PrestoSoft