Jump to content

Categorization


The Little Guy

Recommended Posts

I would like to build an app that can scan a web document, then pick out key words of that page to categorize it in my database.

 

Any suggestions on paths I should take to optimize something like this, and what I could do to pick out special words for categorization?

 

I was planning:

- Scan the page

- Save the url in a database

- Save the url's categories with a foreign key to the above in another database (since one url can have multiple categories)

- Save the web page info in another database for searching

 

 

Basically in the end when a user searches something there will be a database for categories, and the search will be annualized, and then the best table will be selected for searching for relevant results.

 

For example say the user searches for "Border Collie" It would then search in the "Animal" table for information about border collies since border collies are animals.

 

Example categories would be:

- programming

- animals

- games

- music

- people

- shopping

- food

- health

 

the list will probably be huge, so any help would be awesome!

Link to comment
https://forums.phpfreaks.com/topic/183539-categorization/
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.