Page 1 of 1

Bits of Word files missing

Posted: Fri Oct 18, 2013 9:46 am
by BillSparrow
Comparing two Word 2000 documents, where one contains an elongated hyphen, results in all the text to the left of the hyphen being omitted from the line.
I have traced the bug to the plug-in catdoc.exe
Running catdoc.exe from the command line on this Word document gives an output which omits all characters on the offending line up to and including the hyphen.
When I open the Word document in a hex file viewer, the hyphen appears to be hex 96, not the usual 2D.
I don't know how the non-standard hyphen got into the document, though.

Re: Bits of Word files missing

Posted: Fri Oct 18, 2013 10:07 am
by psguru
We have no control over catdoc implementation, and it has its limitations. It's a quite popular tool so you may be able to find some information on this on the web.