duplicate patient registration entries

  3 posts   Feedicon  
Replies: 2 - Last Post: April 13, 2011 21:55
by: Csaba Toth
showing 1 - 3 of 3
 
Posted: March 29, 2011 23:18 by ricardoglopez
Hi, I'm doing tests with OPENEMPI. I imported a test file and error dataset_A_1000 the amount 3 times, so when I find a patient, which is 3 times. My question is: I need not have stated that I am somehow incorporating the same patient several times to control the uniqueness? Thanks.
 
Posted: April 09, 2011 14:54 by Csaba Toth
Sorry ricardoglopez, but I cannot interpret your question. Could you rephrase it, possibly with better English?
 
Posted: April 13, 2011 21:55 by Csaba Toth
One side note: you don't need to import the example FEBRL datasets multiple times. They are crafted in a way that they contain multiple mangled entries of same records. There's a description how many duplicated entries it has and what kind of corruption algorithm altered them. A, B, ... F type datasets differ in that. Check it out.
After importing you can perform a deduplication with running a matching. Before Match you have to configure blocking and the deterministic/exact macthing parameters or the probabilistic/Fellegi-Sunter matching parameters depending on what you want to do.
Replies: 2 - Last Post: April 13, 2011 21:55
by: Csaba Toth
  • Mysql
  • Glassfish
  • Jruby
  • Rails
  • Nblogo
Terms of Use; Privacy Policy;
© 2010, Oracle Corporation and/or its affiliates
(revision 20120518.3c65429)
 
 
Close
loading
Please Confirm
Close