Jump to content
acebase

Need advice what to use to scrape Chinese website

Recommended Posts

I need to scrape a Chinese website. (I guess there is no difference between scraping a Chinese website and a normal one?)

It's my suppliers weblink. They have have told me to download images and text from their website profile link off 1688.com.

There is an API - but from what I've read, it's pants + my virus checked doesn't allow me to visit the API doc page.

What tool should I use?

I've got lots of experience coding - but master of none. Maybe I still fall into the newbie category. LOL.

I saw a link from an article... they gave these names:

Goutte
Simple HTML DOM
htmlSQL
cURL
Requests
HTTPful
Buzz
Guzzle

Which should I consider?

IMPORTANT: I need to download and then upload into my Woocommerce website.

I need images + variation details. I'll have a little file with translations, so if XYZ is found in Chinese, then it is replaced with this.

I thought I would mention the extra detail incase it was relevant to considering which scraping tool to pick.

Thanks!

 

Edited by acebase

Share this post


Link to post
Share on other sites

If they have an API that does what you need then that's going to be the best way to go. Even if it's not the greatest. Because scraping stuff correctly and accurately is difficult.

Disable your antivirus's web scanning part and (carefully) try viewing the docs again?

Share this post


Link to post
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.


×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.