effigy's Content - Page 3

[SOLVED] preg_replace help

effigy replied to Brian W's topic in Regex Help

Character classes only match one character. You must add a quantifier to match more. The replacement needs to be evaluated in order to act on the matched data. preg_replace('/("[^"]+")/e', 'str_replace(" ", "_", "$1")', $string);

December 30, 2008
3 replies

[SOLVED] How can I strip "$5,000" to just "5000"?

effigy replied to virtuexru's topic in PHP Coding Help

Store and work with the number in its digit form, using number_format for its display.

December 29, 2008
6 replies

Weird Chars

effigy replied to tobeyt23's topic in PHP Coding Help

I get three pages worth; two if the phrase is quoted.

December 23, 2008
8 replies

Weird Chars

effigy replied to tobeyt23's topic in PHP Coding Help

Since the diamond is preceding an "s," I assume you've got "smart quotes" on your hands. Search the forum for this.

December 23, 2008
8 replies

string search and replace

effigy replied to gevans's topic in Regex Help

Actually, it needs another tweak in case the attributes are not quoted: %<a href=([\'"])?((??!\1)[^>\s])+\.pdf)(?(1)\1)>(.*?)</a>%si The first non-literal part of the regex looks for a single or double quote, which may not exist at all. Afterwards, it captures one character (that is not whitespace or ">") at a time, but only if it does not encounter the (optional) quote that it began with. In other words, if a single quote was found, match all of its contents up to the ending single quote; the same goes if a double quote was matched. If nothing was found, it stops at the end of the tag. It then backtracks to make sure the URL ends with ".pdf", matches the ending quote if one was found, the end of the tag, the rest of the content up to "</a>", and then "</a>" itself. Keep in mind that this regex only works if no other attributes are present and the formatting is exact. Here's a technical breakdown: NODE EXPLANATION ---------------------------------------------------------------------- <a href= '<a href=' ---------------------------------------------------------------------- ( group and capture to \1 (optional (matching the most amount possible)): ---------------------------------------------------------------------- ['"] any character of: ''', '"' ---------------------------------------------------------------------- )? end of \1 (NOTE: because you're using a quantifier on this capture, only the LAST repetition of the captured pattern will be stored in \1) ---------------------------------------------------------------------- ( group and capture to \2: ---------------------------------------------------------------------- (?: group, but do not capture (1 or more times (matching the most amount possible)): ---------------------------------------------------------------------- (?! look ahead to see if there is not: ---------------------------------------------------------------------- \1 what was matched by capture \1 ---------------------------------------------------------------------- ) end of look-ahead ---------------------------------------------------------------------- [^>\s] any character except: '>', whitespace (\n, \r, \t, \f, and " ") ---------------------------------------------------------------------- )+ end of grouping ---------------------------------------------------------------------- \. '.' ---------------------------------------------------------------------- pdf 'pdf' ---------------------------------------------------------------------- ) end of \2 ---------------------------------------------------------------------- (?(1) if back-reference \1 matched, then: ---------------------------------------------------------------------- \1 what was matched by capture \1 ---------------------------------------------------------------------- | else: ---------------------------------------------------------------------- succeed ---------------------------------------------------------------------- ) end of conditional on \1 ---------------------------------------------------------------------- > '>' ---------------------------------------------------------------------- ( group and capture to \3: ---------------------------------------------------------------------- .*? any character (0 or more times (matching the least amount possible)) ---------------------------------------------------------------------- ) end of \3 ---------------------------------------------------------------------- </a> '</a>' ----------------------------------------------------------------------

December 23, 2008
10 replies

string search and replace

effigy replied to gevans's topic in Regex Help

%<a href=([\'"])?((??!\1).)+\.pdf)(?(1)\1)>(.*?)</a>%si

December 23, 2008
10 replies

string search and replace

effigy replied to gevans's topic in Regex Help

<pre> <?php $html = 'this is some stuff <a href="http://www.mydomain.com/dir/thefile.pdf">Read More</a> for updating the <a href="http://youdomain.com/another.pdf">Other Stuff</a>html'; $replace = <<<REPLACE <div class="pdf"> <a href="$2" target="_blank" title="$3"> <img class="left" src="images/pdf_download.png" alt="Download PDF" width="64" height="74" /> </a> <span class="title">$3</span> <span class="info">download pdf</span> <a href="$2" target="_blank" title="$3" class="link">DOWNLOAD</a> </div> <div class="pdf-bot2"></div> REPLACE; $html = preg_replace( '%<a href=([\'"])?((?(1).+?|[^\s>]+)\.pdf)(?(1)\1)>(.*?)</a>%si', $replace, $html ); echo htmlspecialchars($html); ?> </pre>

December 23, 2008
10 replies

preg_match_all help

effigy replied to dtdetu's topic in Regex Help

You need the s modifier so that . will also match new lines: preg_match_all('~<td class="trow1">(.*?)</td>~is', $searchresult, $topics);

December 23, 2008
1 reply

preg_replace with quoting

effigy replied to Guernica's topic in Regex Help

Try this.

December 22, 2008
4 replies

Match only when something is not there

effigy replied to Sildhe's topic in Regex Help

Actually, this is correct. I crossed my wires on the lazy/greedy portion, while the real issue is using [^>]*? rather than .*? (or with +, doesn't matter). My apologies. The difference between the lazy/greedy approach depends, as the book says, on the data.

December 22, 2008
16 replies

Match only when something is not there

effigy replied to Sildhe's topic in Regex Help

<pre> <?php $string = <<<STR <a href='blah' id=1234.2>[FLAG] something</a> <a href='blah' id=829.1>somethingelse</a> <a href='blah' id=634.5>somerandomcharlength</a> STR; preg_match_all('%<a[^>]+id=([\d.]+)[^>]*>(?!\[FLAG\]\s)%si', $string, $matches); array_shift($matches); print_r($matches); ?> </pre>

December 22, 2008
16 replies

Match only when something is not there

effigy replied to Sildhe's topic in Regex Help

The concern isn't of id= being outside of a tag, but of a tag not having id=. In this instance the regex would keep consuming data--going outside of the tag and running into another, possibly not even an a--until it finds id=. Arguably, the data in question may always have id= in the a; however, (1) data may change; and (2) [^>]* will work in both cases. Additionally, according to Mastering Regular Expressions:

December 22, 2008
16 replies

[SOLVED] preg_match help

effigy replied to dtdetu's topic in Regex Help

So what do you want? What you posted is the latest news entry.

December 22, 2008
6 replies

[SOLVED] preg_match help

effigy replied to dtdetu's topic in Regex Help

%</small>\s*</p>\s*<p>(.*?)</p>%si

December 22, 2008
6 replies

Match only when something is not there

effigy replied to Sildhe's topic in Regex Help

When you're working with a known format--e.g., HTML tags begin with "<" and end with ">"--conform to these rules in your pattern: don't use <a.*?...> but <a[^>]*...>. Not only is the greediness optimal, but safer, ensuring that you stay within your tag boundary.

December 22, 2008
16 replies

preg_replace & function

effigy replied to jaymc's topic in PHP Coding Help

Look into the /e modifier; this will let you pass $1 after it is defined by the regular expression, rather than having it evaluated beforehand.

December 19, 2008
13 replies

Character encoding issue

effigy replied to Jabop's topic in PHP Coding Help

Have you tried html_entity_decode? I would compare the strings in this fashion since you wouldn't run across an instance of comparing a named entity to a numerical.

December 17, 2008
7 replies

[SOLVED] Regex to find and append after HTML FORM Tag

effigy replied to zacware's topic in Regex Help

I would use /(<form[^>]+action="process.php"[^>]*>)/ since greediness is faster. It also constrains the pattern within tag boundaries just in case you encounter some bad HTML.

December 17, 2008
3 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

<pre> <?php $html = 'http://www.google.com<br>Go there for a cool search engine!'; ### Similar to strip_tags, but replace with a space. $html = preg_replace('/<[^>]*>/', ' ', $html); preg_match('%https?://\S+(?<!\p{P})%i', $html, $matches); print_r($matches); ?> </pre>

December 17, 2008
22 replies

Matching whole words in Unicode

effigy replied to lwc's topic in Regex Help

Well, what I was really after is: are you using any HTML or XML tools for this? Typically these will handle entities, isolation of content, etc.

December 17, 2008
18 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

That data works in the example code: <pre> <?php $html = <<<HTML http://www.google.com Go there for a cool search engine! HTML; preg_match('%https?://\S+(?<!\p{P})%i', strip_tags($html), $matches); print_r($matches); ?> </pre> What else is happening in your code?

December 16, 2008
22 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

How about something like this? <pre> <?php $html = <<<HTML <a href="http://www.phpfreaks.com">PHP Freaks</a> <a href="http://www.google.com/index.html">Visit http://www.google.com!</a> HTML; preg_match('%https?://\S+(?<!\p{P})%i', strip_tags($html), $matches); print_r($matches); ?> </pre>

December 16, 2008
22 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

Do you want to pull URLs from tags, content, or both?

December 16, 2008
22 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

What is the format of these entries? HTML? Prose? Anything?

December 16, 2008
22 replies

[SOLVED] Regular Expression Trouble

effigy replied to carrotcake1029's topic in Regex Help

%https?://[^\"\s>]+%i Will the URLs always be double quoted?

December 16, 2008
22 replies

Sign In

effigy

Posts

Joined

Last visited

Content Type

Profiles

Forums

Everything posted by effigy

[SOLVED] preg_replace help

[SOLVED] How can I strip "$5,000" to just "5000"?

Weird Chars

Weird Chars

string search and replace

string search and replace

string search and replace

preg_match_all help

preg_replace with quoting

Match only when something is not there

Match only when something is not there

Match only when something is not there

[SOLVED] preg_match help

[SOLVED] preg_match help

Match only when something is not there

preg_replace & function

Character encoding issue

[SOLVED] Regex to find and append after HTML FORM Tag

[SOLVED] Regular Expression Trouble

Matching whole words in Unicode

[SOLVED] Regular Expression Trouble

[SOLVED] Regular Expression Trouble

[SOLVED] Regular Expression Trouble

[SOLVED] Regular Expression Trouble

[SOLVED] Regular Expression Trouble

Browse

Activity

Important Information