JonnoTheDev Posted October 29, 2009 Share Posted October 29, 2009 I don't understand what the difference is. Or would do. This is your requirement yes? I have an auction site that the client wants the search engine to be able to search for items within a given mileage radius. To find cities, towns, train stations, mcdonalds, whatever, within a given distance of a set point you use geo co-ordinates (latitude and longitude). It seems that you have this data in your database. So, once you have the lat, lon of your start point you can use what is called the haversine formula to find the nearest... You are using this in the code, yes, but you do not have to. Use in the actual query to obtain the results required. The query that you are currently using is highly inefficient. Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947029 Share on other sites More sharing options...
DavidAM Posted October 29, 2009 Share Posted October 29, 2009 it doesn't display also the originating ZIP You are specifically excluding the search zip code with these two lines: AND (latitude != $lat1 AND longitude != $lon1) since $lat1 and $lon1 are the values you retrieved from the database for the search zipcode, you are telling the query NOT to return that record. I'm not sure why it would be including 74133. Check the latitude and longitude for that record, make sure there is only one record for that zip, echo the sql and look to see what is causing it to be included. Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947066 Share on other sites More sharing options...
johnathanamber Posted October 29, 2009 Author Share Posted October 29, 2009 @David, OK, yes, that did fix it. Thank you for all your help. Will work on why it is showing the Tulsa ZIP. @Neil, So other than being inefficiant... would this cause any issues? Would it speed up performance? Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947235 Share on other sites More sharing options...
JonnoTheDev Posted October 29, 2009 Share Posted October 29, 2009 Would it speed up performance? absolutely Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947493 Share on other sites More sharing options...
johnathanamber Posted October 30, 2009 Author Share Posted October 30, 2009 @Neil, So what would it look like if you used this as a basis: $query = "SELECT id, title, shortdescription, description, picture, BIN_price, auction_type, start_date, end_date, hits, cat, auction_nr, nr_items_left, reserve_price, zipcode, weight, zip, city, state FROM jos_bid_auctions JOIN zip_codes ON zipcode = zip WHERE description OR shortdescription like '%$term%' AND(latitude <= $latN AND latitude >= $latS AND longitude <= $lonE AND longitude >= $lonW) ORDER BY zipcode"; } Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947596 Share on other sites More sharing options...
JonnoTheDev Posted October 30, 2009 Share Posted October 30, 2009 Look at my prev post on this page. It has an example Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947691 Share on other sites More sharing options...
DavidAM Posted October 30, 2009 Share Posted October 30, 2009 @Neil I do not see how your query is more efficient than the one I submitted. In fact, it looks less efficient. Without the limitations on the latitude and longitude, this query will have to calculate the distance for every auction in the table. SELECT a.description, a.zipcode, z.city, z.state, SQRT(POW(69.1 * (z.latitude - ".$latitude."), 2) + POW(69.1 * (".$longitude." - z.longitude) * COS(latitude / 57.3) , 2)) AS distance FROM jos_bid_auctions a INNER JOIN zipcodes z ON (a.zipcode=z.zipcode) HAVING distance < ".$distance." ORDER BY distance ASC The query I submitted, on the other hand, can take advantage of indexes on both tables to reduce the number of rows to be processed. If Latitude and/or Longitude is indexed on the zipcode table, then that will limit the number of zipcodes to be checked. Once a zipcode is found that fits the limits, an index on zipcode in the aution table can be used to retrieve the record. SELECT A.description, A.zipcode, Z.city, Z.state FROM jos_bid_auctions A JOIN zipcodes Z ON A.zipcode = Z.zipcode WHERE A.description like '%auction%' AND Z.latitude BETWEEN 36.286982858078 AND 35.997537141922 AND Z.longitude BETWEEN -95.699526863467 AND -96.058611136533 ORDER BY Z.state, Z.city, Z.zipcode$ Granted, having the distance calculation in the query could be handy. But that could be done in PHP against the resultset. I'm not sure which process there might be more efficient, I'm not sure it would make much difference. And I will admit I do not know the internals of mySql; but I do know that a table scan looses to an indexed read in almost every case. Also, I have never used a float for an index, but I don't see how it could cause significant problems. In fact, I might personally be inclinded to make the lat and long DECIMAL columns with 5 or 6 digits on the right. @johnathan Use BETWEEN instead of <= AND >=. I think the server can make better use of the indexes since it will know both limits in one clause. At any rate, it should not be less efficient. Just keep in mind that BETWEEN is inclusive (both limits also satisfy the condition). Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947788 Share on other sites More sharing options...
JonnoTheDev Posted October 30, 2009 Share Posted October 30, 2009 In fact, it looks less efficient. Without the limitations on the latitude and longitude, this query will have to calculate the distance for every auction in the table. This is not an issue. Having indexes on latitude, longitude will be fast. Indexes should also exist for zipcode as there is a join on the field. The returned result set should not be large as you are limiting to a given distance. Granted, having the distance calculation in the query could be handy. But that could be done in PHP against the resultset. You should not use code to manipulate data that can be done from an initial query and return the correct data. More code, more parse time and more issues when it comes to updating the program. This is good practice for any language. In fact, I might personally be inclinded to make the lat and long DECIMAL Geo data is best stored as a float. I have sites running maps using these queries from millions of records without issue. Read up on geocoding with mysql. It is very efficient. Quote Link to comment https://forums.phpfreaks.com/topic/178374-new-search-engine/page/2/#findComment-947920 Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.