ExamDiff don't see extra lines correctly

General questions about using ExamDiff Pro, ideas for new features, bug reports, and usage tips.
Post Reply
WolfJe63
New Member
Posts: 1
Joined: Sun Jun 27, 2004 3:59 am

ExamDiff don't see extra lines correctly

Post by WolfJe63 »

Hi,

Sometimes ExamDiff does not see newly inserted lines in two files correclty.

I have some CSV files that have a large number of lines (about 10.000 items).
The first 18 digits contain the item number (ie 000000000000110535).

When comparing two files, there are item numbers inserted an ExamDiff just cannot find the difference.

ie:

File 1:
000000000000110535
000000000000110536
000000000000110539
000000000000110540

File 2:
000000000000110535
000000000000110536
000000000000110537
000000000000110538
000000000000110539
000000000000110540

Result:
000000000000110535 000000000000110535
000000000000110536 000000000000110536
000000000000110539 000000000000110537 <Examdiff says: changed line
000000000000110540 000000000000110538 <Examdiff says: changed line
__________________ 000000000000110539 <Examdiff says: added line
__________________ 000000000000110540 <Examdiff says: added line

What I Expected:
000000000000110535 000000000000110535
000000000000110536 000000000000110536
__________________ 000000000000110537 <Examdiff says: added line
__________________ 000000000000110538 <Examdiff says: added line
000000000000110539 000000000000110539
000000000000110540 000000000000110540


Offcourse the lines contains a lot more information (the files are about 5Mb each). I have the files and /or screenprints available. Please send me a Private Message to get them.

Maybe there is a setting to see make ExamDIff behave correctly. Want I want to do is:
- See which items are changed (added/removed)
- See which characters per line have changed.

Some lines are handled correctly in the file though.

Thanks,
Jan Paul
The Netherlands
User avatar
psguru
Site Admin
Posts: 2231
Joined: Sat May 15, 2004 4:23 pm
Location: California
Contact:

Post by psguru »

ExamDiff Pro can only match IDENTICAL lines. EDP does not know that it should compare similar lines -- they are similar but not identical. Everything is black and white: either lines are the same, and hence they have to be matched (aligned), or the lines are different, and they don't have to be matched. There's also an explanation of this phenomenon at
http://www.prestosoft.com/ps.asp?page=edp_faq#20

If you want some lines to match, you need to make them identical using different "Ignore" options, such as regexes.


If your files just had

File 1:
000000000000110535
000000000000110536
000000000000110539
000000000000110540

File 2:
000000000000110535
000000000000110536
000000000000110537
000000000000110538
000000000000110539
000000000000110540

EDP would've matched lines properly.
psguru
PrestoSoft
Post Reply