kickstart Posted April 16, 2009 Share Posted April 16, 2009 Hi Trying to get my head round this problem. Suspect the solution is something simple that I have missed. I have a table with several fields which I want to search for a particular term, basically name, address and email address. I want a limit of 10 rows returned, I want the matches on name first, then address and then email address (if the total is still under 10). And I don't want any duplicates. SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'A' as MatchType FROM loanee_table WHERE Active = 1 AND upper(Name) LIKE '$searchFieldName' UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'B' as MatchType FROM loanee_table WHERE Active = 1 AND upper(Address) LIKE '$searchFieldName' UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'C' as MatchType FROM loanee_table WHERE Active = 1 AND upper(EmailAddress) LIKE '$searchFieldName' ORDER BY MatchType, Name Trouble with this is that the UNION brings back rows that are duplicated due to the MatchType field (which I want as I want the matches on name first, then the matches on address and then the matches on email address). Only way I can see round this is to specifically exclude prior matches on the subsequent selects like this :- SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'A' as MatchType FROM loanee_table WHERE Active = 1 AND upper(Name) LIKE '$searchFieldName' UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'B' as MatchType FROM loanee_table WHERE Active = 1 AND upper(Address) LIKE '$searchFieldName' AND LoaneeId NOT IN (SELECT LoaneeId FROM loanee_table WHERE Active = 1 AND upper(Name) LIKE '$searchFieldName' ) UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 'C' as MatchType FROM loanee_table WHERE Active = 1 AND upper(EmailAddress) LIKE '$searchFieldName' AND LoaneeId NOT IN (SELECT LoaneeId FROM loanee_table WHERE Active = 1 AND upper(Name) LIKE '$searchFieldName' UNION SELECT LoaneeId FROM loanee_table WHERE Active = 1 AND upper(Address) LIKE '$searchFieldName' ) ORDER BY MatchType, Name which looks likely to be hideously inefficient to me. Any suggestions for an efficient way of doing this? All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/ Share on other sites More sharing options...
jackpf Posted April 16, 2009 Share Posted April 16, 2009 You could put everything into an array, and then use array_unique() to get rid of duplicates. That's what I do... Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-811470 Share on other sites More sharing options...
kickstart Posted April 16, 2009 Author Share Posted April 16, 2009 Hi Possible, but think I would have issues as the array items are not unique (due to the Matchtype), as I would have to pass an array of arrays. I could get rid of the matchtype so array_unique would work, but if I did that I could remove it from the SELECT statements and the UNION would take care of the duplicates. However doing that would make the order incorrect (as I want matches on the first field, then matches on the 2nd field, then matches on the 3rd field). Cheers for the suggestion. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-811504 Share on other sites More sharing options...
jackpf Posted April 16, 2009 Share Posted April 16, 2009 Yeah... Hmm...that's all I've got tbh. I shall wait until someone more knowledgeable comes along... Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-811617 Share on other sites More sharing options...
fenway Posted April 16, 2009 Share Posted April 16, 2009 Try UNION DISTINCT (but then you'll have to drop "match type") Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-811760 Share on other sites More sharing options...
kickstart Posted April 16, 2009 Author Share Posted April 16, 2009 Hi Cheers. However UNION DISTINCT is the default for UNION, and the match type is required to get the order right so cannot really be dropped. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-811818 Share on other sites More sharing options...
fenway Posted April 17, 2009 Share Posted April 17, 2009 ....and the match type is required to get the order.... I disagree... the UNION will return the rows in the "order" the select statements are provided; and you can sort each one of the select statements by "name". Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812284 Share on other sites More sharing options...
kickstart Posted April 17, 2009 Author Share Posted April 17, 2009 I disagree... the UNION will return the rows in the "order" the select statements are provided; While it may work, the MySQL online pages tell you to use an extra field to force this order, which suggests that this is not certain (and it is heavily recommended against relying on any default order in SQL in general). To cause rows in a UNION result to consist of the sets of rows retrieved by each SELECT one after the other, select an additional column in each SELECT to use as a sort column and add an ORDER BY following the last SELECT:. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812348 Share on other sites More sharing options...
fenway Posted April 17, 2009 Share Posted April 17, 2009 I didn't say not to use an order by clause... i mean "ORDER BY name" in each one. Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812524 Share on other sites More sharing options...
kickstart Posted April 17, 2009 Author Share Posted April 17, 2009 Hi Yes, but that relies on each SELECT in the UNION coming back in order, and the MySQL documentation suggests that to get this to happen you need to add a field to each SELECT and sort on that extra column (which is what I have done and which prevents the elimination of duplicates). While it might work it seems far from certain it working is a design consideration in MySQL. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812566 Share on other sites More sharing options...
fenway Posted April 17, 2009 Share Posted April 17, 2009 While it might work it seems far from certain it working is a design consideration in MySQL. I assume you're referring to this quote from the manual? However, use of ORDER BY for individual SELECT statements implies nothing about the order in which the rows appear in the final result because UNION by default produces an unordered set of rows I've never actually had this be a problem... the only "workaround" is to use a GROUP BY and "filter" the rows, taking advantage of mysql's loose group by restrictions. But then you might get a random A/B/C... do you care? Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812584 Share on other sites More sharing options...
kickstart Posted April 17, 2009 Author Share Posted April 17, 2009 Hi Unfortunatly with this I do care. It is to provide a drop down list of people for an Ajax call for the user to select when they type, hence I don't want Mr Smith with the email address [email protected], and I want the matches on name to appear before matches on other fields. There are various ways round it (excluding previous matches but using a NOT IN on subselects that are the same as the previous selects, or manually processing it in php), but all appear to me to be inefficient fudges. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812601 Share on other sites More sharing options...
fenway Posted April 17, 2009 Share Posted April 17, 2009 Say that again? How can they be different if they're duplicates?!?! Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812630 Share on other sites More sharing options...
kickstart Posted April 17, 2009 Author Share Posted April 17, 2009 Hi In my original post you can see I have 3 SELECT statements unioned. I sort the results of this to make sure that the results or the first SELECT come 1st, second SELECT 2nd, etc, and have a constant on each set of results so this sort works (basically what the MySQL pages advise). Imagine searching for "smith" and there is a row for "Mr Smith", of "Smith Street" with an email address of "[email protected]". Each one of the SELECTs would bring back that same row. With a normal set of UNIONs they would be treated as duplicates and thus would only appear once. However because I have added the constant to use for the sort to ensure the results come back in the right order the rows are no longer exactly the same so are not eliminated as duplicates. While I can see ways round it none of them are elegant. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812637 Share on other sites More sharing options...
fenway Posted April 17, 2009 Share Posted April 17, 2009 I only now realize that you're UNIONing the *same* table! This is the wrong approach --- what you want is weighted matching. Then you don't need any magical union. Simply assign a higher "score" for a match on name than on e-mail, and sort on this column. Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-812654 Share on other sites More sharing options...
kickstart Posted April 18, 2009 Author Share Posted April 18, 2009 Hi I am intrigued. How do you mean to do this? All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-813310 Share on other sites More sharing options...
kickstart Posted April 20, 2009 Author Share Posted April 20, 2009 Hi Inspired by you suggestion I came up with this. SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, SUM( MatchType ) FROM ( SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 4 AS MatchType FROM loanee_table WHERE Active =1 AND upper( Name ) LIKE '%TEST%' UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 2 AS MatchType FROM loanee_table WHERE Active =1 AND upper( Address ) LIKE '%TEST%' UNION SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, 1 AS MatchType FROM loanee_table WHERE Active =1 AND upper( EmailAddress ) LIKE '%TEST%' ) AS Deriv1 GROUP BY LoaneeId ORDER BY MatchType DESC , Name Not perfect but basically gives me what I want. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-814400 Share on other sites More sharing options...
fenway Posted April 21, 2009 Share Posted April 21, 2009 BTW, if you have a case-insensitive collation, you're just killing the index that way. I actually meant this: SELECT LoaneeId, Name, Address, PhoneNumber, EmailAddress, CASE WHEN upper( Name ) LIKE '%TEST%' THEN 4 WHEN upper( Address ) LIKE '%TEST%' THEN 2 WHEN upper( EmailAddress ) LIKE '%TEST%' THEN 1 ELSE 0 END ) AS score FROM loanee_table WHERE Active =1 AND ( upper( Name ) LIKE '%TEST%' OR upper( Address ) LIKE '%TEST%' OR upper( EmailAddress ) LIKE '%TEST%' ) ORDER BY score DESC Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-815765 Share on other sites More sharing options...
kickstart Posted April 22, 2009 Author Share Posted April 22, 2009 Hi Interesting. Not thought of doing it that way. Thanks. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816208 Share on other sites More sharing options...
fenway Posted April 22, 2009 Share Posted April 22, 2009 So, solved? Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816529 Share on other sites More sharing options...
kickstart Posted April 22, 2009 Author Share Posted April 22, 2009 Hi Probably, but still need to do some playing with it . Will mark it as solved when certain. Thanks. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816547 Share on other sites More sharing options...
kickstart Posted April 22, 2009 Author Share Posted April 22, 2009 Hi Does seem to do it nicely. Only thing I wanted to have a play with was using "HAVING score > 0" (bit of abuse of the HAVING statement) instead of repeating the LIKE clauses in the WHERE. However that seems to slow things down. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816574 Share on other sites More sharing options...
fenway Posted April 22, 2009 Share Posted April 22, 2009 Yes, that would VERY slow! Why does it matter if it's "repeated" Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816582 Share on other sites More sharing options...
kickstart Posted April 22, 2009 Author Share Posted April 22, 2009 Hi Just wondering if mysql would otherwise evaluate the like statements twice when it isn't necessary. All the best Keith Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816648 Share on other sites More sharing options...
fenway Posted April 22, 2009 Share Posted April 22, 2009 I'm hoping the optimizing handles this... even if it does, evaluating LIKE in the column list isn't expensive. Quote Link to comment https://forums.phpfreaks.com/topic/154354-solved-unique-records-on-a-union/#findComment-816667 Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.