colombian Posted May 20, 2008 Share Posted May 20, 2008 How could I got about creating a script for deduping a mailing list? (Not email - just Name and address) I know this is broad, I just need some pointers and Google hasn't been helpful. The files can be obtained in CSV, Excel, among others (they can be converted from one to the other easily - so whichever format is easier to dedupe) Quote Link to comment Share on other sites More sharing options...
BlueSkyIS Posted May 20, 2008 Share Posted May 20, 2008 load your data into a mysql database, table some_table, then: SELECT DISTINCT (name, address) FROM some_table Quote Link to comment Share on other sites More sharing options...
colombian Posted May 20, 2008 Author Share Posted May 20, 2008 Thanks Blue, I believe SQL might not be enough - but hopefully you can shed some light on this: What if the names / addresses are slightly different? Like: John R. Smith, 600 pacificic ave, CA 98342 and John Rogers Smith, 600 SW Pacific AVE, CA 98342 Any ideas? Thanks again. Quote Link to comment Share on other sites More sharing options...
Psycho Posted May 20, 2008 Share Posted May 20, 2008 I don't know that there is going to be an easy solution. There are a ton of variables to be considered when searching for duplicates. There are companies that sell software for a lot of money to do just that so gettng their criteria will not be possible. So, your best bet is to look for a pre-built solution if someone has been kind enough to release something publicly. However, the type of license they released the code under may prevent you from using it for free. You'll just have to check the restritions if you find something. If you can't find something, and assuming you don't want to buy something. your only option is to build a solution or pay someone to build it. If it doesn't have to be PHP, here is a list of applicatins that might work. They range in price from $20 to over $600. The level of effectiveness will most likley be directly proportional to the price. Some may be able to be incorporated in PHP if built to do so. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.