etrader Posted January 3, 2011 Share Posted January 3, 2011 I'm looking for (1) a code to analyze the current page and show main words of the text, and (2) capturing similar words from the internet. Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/ Share on other sites More sharing options...
mikecampbell Posted January 3, 2011 Share Posted January 3, 2011 What do you mean by the "main words of the text"? Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154121 Share on other sites More sharing options...
etrader Posted January 3, 2011 Author Share Posted January 3, 2011 What do you mean by the "main words of the text"? I mean frequently repeated words. I consider SEO issues to capture tags. Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154122 Share on other sites More sharing options...
fortnox007 Posted January 3, 2011 Share Posted January 3, 2011 i am prettyy sure you don't mean frequently repeated words. But keywords otherwise you end-up with words like: " I are is we do etc " Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154124 Share on other sites More sharing options...
etrader Posted January 3, 2011 Author Share Posted January 3, 2011 i am prettyy sure you don't mean frequently repeated words. But keywords otherwise you end-up with words like: " I are is we do etc " You are quite right fortnox007. I assumed to avoid such common words. Exactly, I mean keywords. Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154125 Share on other sites More sharing options...
fortnox007 Posted January 3, 2011 Share Posted January 3, 2011 I am not sure if this can be done with an automated script really, because the script lacks the understanding of meaning. maybe ask your users to give some keywords in relation to the script. You could of course explode the whole string and count the stuff, but that would end you up with a lot of crap i think. I think this is something only humans can do. Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154131 Share on other sites More sharing options...
etrader Posted January 3, 2011 Author Share Posted January 3, 2011 What do you think of considering long words (e.g. with more than 8 characters), then all common words like "of" "the" will be omitted. It is not perfect, but at least it works any idea? Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154132 Share on other sites More sharing options...
Anti-Moronic Posted January 3, 2011 Share Posted January 3, 2011 What do you think of considering long words (e.g. with more than 8 characters), then all common words like "of" "the" will be omitted. It is not perfect, but at least it works any idea? I would create a wordlist of common trash words you don't need. Count the words, extract most common, filter out the trash words using our list. That should be easy enough. Even removing 1000 common 'trash' words would greatly reduce any text to 'key words'. I'd be interested to see if anyone has a better way. You might also consider looking at some 'key word' extraction apps in php. Search about, not sure if they exist and only one way to find out Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154152 Share on other sites More sharing options...
GalaxyTramp Posted January 3, 2011 Share Posted January 3, 2011 Hi This article may be of some help http://nadeausoftware.com/articles/2008/04/php_tip_how_extract_keywords_web_page GT Quote Link to comment https://forums.phpfreaks.com/topic/223242-related-words-captured-from-a-text/#findComment-1154161 Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.