ricardoglopez
|
Posted: March 29, 2011 23:18 by ricardoglopez
|
| Hi, I'm doing tests with OPENEMPI. I imported a test file and error dataset_A_1000 the amount 3 times, so when I find a patient, which is 3 times. My question is: I need not have stated that I am somehow incorporating the same patient several times to control the uniqueness? Thanks. |
duplicate patient registration entries
Replies: 2 - Last Post: April 13, 2011 21:55
by: Csaba Toth
by: Csaba Toth
showing 1 - 3 of 3
Csaba Toth
|
Posted: April 09, 2011 14:54 by Csaba Toth
|
| Sorry ricardoglopez, but I cannot interpret your question. Could you rephrase it, possibly with better English? |
Csaba Toth
|
Posted: April 13, 2011 21:55 by Csaba Toth
|
|
One side note: you don't need to import the example FEBRL datasets multiple times. They are crafted in a way that they contain multiple mangled entries of same records. There's a description how many duplicated entries it has and what kind of corruption algorithm altered them. A, B, ... F type datasets differ in that. Check it out. After importing you can perform a deduplication with running a matching. Before Match you have to configure blocking and the deterministic/exact macthing parameters or the probabilistic/Fellegi-Sunter matching parameters depending on what you want to do. |
Replies: 2 - Last Post: April 13, 2011 21:55
by: Csaba Toth
by: Csaba Toth







