Jump to content

Base64 encoded URLs


freeloader

Recommended Posts

Hi guys,

 

I'm parsing a bunch of html sources, some of them (a minority) contain base64 encoded URLs. At least 20-30 URLs per page. Some of the information I need is taken from the URLs.

 

I'm looking for a way to parse these URLs without losing too much time (20-30 URLs per page and I need to parse about 10k pages daily).

 

Should I regex each source individually for base64 encoded URLs and then decode them each time and isn't that going to take a lot of time/resources (especially considering it's only a minority that have the base64 URLs in them)? Or is there a better way to do it?

 

Code snippets are absolutely welcome!

 

Thank you in advance :)

Link to comment
https://forums.phpfreaks.com/topic/257801-base64-encoded-urls/
Share on other sites

Because these are pre-parsed pages and in some of them, the URLs have been parsed through a base64 encoder.

 

Giving an output like this:

<link href="Oigregregregrer/gergeggege==" media="screen" rel="stylesheet" type="text/css" />

 

(The above is not an actual link.)

Link to comment
https://forums.phpfreaks.com/topic/257801-base64-encoded-urls/#findComment-1321438
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.