Small Regex help

HubertCumperdale · April 20, 2012

I have to admit that regex is something that always sends me crosseyed.... one day I will get my head around it.

I have a small piece of code that is used to create hyperlinks for things like #Facebook from a Twitter RSS feed.

It works ok, however sometimes we get: #Facebook, or #Facebook: or #Facebook.

What I would like is to also remove these common end characters such a commas, colons and periods.

This is the code I currently have:

$description = preg_replace("/#([^ ]+)/", "<a href=\"https://twitter.com/#!/search/%23$1\">#$1</a>", $description);

Any help appreciated!

.josh · April 20, 2012

This will throw those 3 chars into the mix, to match up to. It says to match one or more of anything that is not a space, comma, pound or colon:

$description = preg_replace("/#([^ ,#:]+)/", "<a href=\"https://twitter.com/#!/search/%23$1\">#$1</a>", $description);

or alternatively, this will just match one or more word char (letter, number or underscore)

$description = preg_replace("/#(\w+)/", "<a href=\"https://twitter.com/#!/search/%23$1\">#$1</a>", $description);

HubertCumperdale · April 21, 2012

Used the first one, works great.... thanks!

HubertCumperdale · April 21, 2012

Actually, the second one is better... never know what char may be at the end of a word: . : ; ? - * etc.....

Recommended Posts