-
1. Re: Duplicated matches in MTCH table
Pandiarajan B Oct 23, 2020 3:09 AM (in response to Jesus Perez)Hi,
There is no check which would actually check in the MTCH table to identify if there is an entry created for two corresponding rowid values from the previous executions. So there is a possibility for duplicate requests present as it is of Manual match rules, Expectation for the Manual matches is that data steward reviews the record and then approve the right one. In some cases there might be a confusion on picking the right one among the multiple entries . That is the reason we usually recommend to either merge the records/remove the entries from the MTCH table before running the next match job.
Hope it helps!
Regards,
Rajan -
2. Re: Duplicated matches in MTCH table
Pandiarajan B Oct 23, 2020 3:09 AM (in response to Pandiarajan B)Also please note, Duplicate entries will be removed once the right records are merged.
Regards,
Rajan -
3. Re: Duplicated matches in MTCH table
kunal pandit Oct 27, 2020 12:10 AM (in response to Jesus Perez)Can you please confirm if you used a single cleanse/process server for this match job or multiple process servers.
-
4. Re: Duplicated matches in MTCH table
Jesus Perez Oct 27, 2020 12:22 AM (in response to kunal pandit)Single process server
-
5. Re: Duplicated matches in MTCH table
Piyush Paras Oct 29, 2020 12:03 PM (in response to Jesus Perez)Are these records matching with the same match rule?
-
6. Re: Duplicated matches in MTCH table
Dhananjay Singh Oct 29, 2020 3:07 PM (in response to Jesus Perez)if you see create_date is different for 2 records, can you please and confirm if you ran Match job twice?
Also as mentioned above we usually recommend to either merge the records/remove the entries from the MTCH table before running the next match job
-
7. Re: Duplicated matches in MTCH table
Sathiesh M Nov 1, 2020 1:10 AM (in response to Jesus Perez)Based on data, i believe originally you must have a different value for ROWID_OBJECT for the 1st record. That must have merged to 12062082, so this ID is updated on the pending manual match records. Thats why you see a UPDATED_BY admin (probably automerge job run by admin)
There's a KB soon to be released on this. For now sharing the content of KB below.
TitleHOW TO: Avoid invalid manual merge tasks in MDM
SolutionIn MDM we can define Auto Match rule and Manual match rule.
At the end of the match job, Build match Group (BMG) process runs on Match pairs that are identified by Auto rule. This makes sure there are no redundant matches and transitive matches are re-arranged to have common ROWID_MATCHED value.
But this process is not applicable for Manual rule, and due to that we may get match pairs like A->B, B->A and B->C. The task daemon created individual tasks for these matches. And say when Data steward works on a Merge task, the other 2 tasks go invalid. Ideally all of the 3 match pairs should have been part of One Task ID, so data stewards can make a decision with all candidates in single screen.
Currently an Enhancement request MDM-21416 in place, which may be triaged for future releases..
Workaround:A quick workaround is creating a new MatchRuleSet specifically for manual/loose rules, but set it up as Auto rule. Run only Match job and not the merge job for this rule, then immediately flip the AUTOMERG_IND to 0 in the _MTCH table for this particular RuleID.
This way made sure the BMG is performed on these records and the Task daemon now assigns single TaskID to all the transitive matches
-
8. Re: Duplicated matches in MTCH table
Jesus Perez Nov 1, 2020 11:08 PM (in response to Piyush Paras)Yes, they match with the same rule
-
9. Re: Duplicated matches in MTCH table
Jesus Perez Nov 2, 2020 12:34 AM (in response to Sathiesh M)Hello Sathiesh. It seems that what you say is exactly what happens. Now the populated updated by and lud columns make sense.
Thanks
-
10. Re: Duplicated matches in MTCH table
Rahul Tiwary Nov 15, 2020 10:42 AM (in response to Jesus Perez)Just to add on the BMG process on Manual rules, In case you run the BMG process for loose rules it will potentially generate huge clusters of unrelated matched records. This will create a lot of work for the data steward to manually go through this clusters and resolve it. This is the reason why BMG designed to not be executed for manual rules.