Jump to content

Recommended Posts

I would like to build an app that can scan a web document, then pick out key words of that page to categorize it in my database.

 

Any suggestions on paths I should take to optimize something like this, and what I could do to pick out special words for categorization?

 

I was planning:

- Scan the page

- Save the url in a database

- Save the url's categories with a foreign key to the above in another database (since one url can have multiple categories)

- Save the web page info in another database for searching

 

 

Basically in the end when a user searches something there will be a database for categories, and the search will be annualized, and then the best table will be selected for searching for relevant results.

 

For example say the user searches for "Border Collie" It would then search in the "Animal" table for information about border collies since border collies are animals.

 

Example categories would be:

- programming

- animals

- games

- music

- people

- shopping

- food

- health

 

the list will probably be huge, so any help would be awesome!

Link to comment
https://forums.phpfreaks.com/topic/183539-categorization/
Share on other sites

This thread is more than a year old. Please don't revive it unless you have something important to add.

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.