ddrudik

Members

View Profile See their activity

Posts
78
Joined
October 29, 2008
Last visited
October 30, 2015

Content Type

All Activity

Profiles

Forums

Topics
Posts

Everything posted by ddrudik

Crawler Related Problem

ddrudik replied to amitswba's topic in Regex Help

Show what the resulting array should look like.
- December 2, 2008
- 2 replies
make this case INsensitive?

ddrudik replied to Lodius2000's topic in Regex Help

The vowel code seems simple enough, although to demonstrate how you might do this with regex functions alone: <?php $sourcestring="Dang this crap, I want to SHOOT somebody. But that would be dangerous!"; echo preg_replace('/(?<=\bCR)A(?=P\b)|(?<=\bSH)OO(?=T\b)/ie','preg_replace(\'/./\',\'*\',\'\0\')',$sourcestring); ?>
- December 1, 2008
- 6 replies
RegEx between two specific tags

ddrudik replied to btray77's topic in Regex Help

Must not be the same then. Consider providing the URL for testing.
- November 30, 2008
- 3 replies
RegEx between two specific tags

ddrudik replied to btray77's topic in Regex Help

<?php $sourcestring="your source string"; preg_match('~<table style="border-collapse: collapse;" id="table9" bordercolorlight="#FFFFFF" border="1" width="100%">(.*?)</table>~s',$sourcestring,$matches); echo "<pre>".print_r($matches,true); ?>
- November 30, 2008
- 3 replies
preg_match_all -> class="l">85.236.100.103 <img src="/i/launch.gif" title="Launc

ddrudik replied to scarhand's topic in Regex Help

Your code works for me, maybe your real-world source is different than your sample: <pre> <?php $content='class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060 class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060 class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060'; preg_match_all('|class="l">([0-9.]+) <img src="/i/launch.gif" title="Launch"/></a></td><td>([0-9]+)|', $content, $matches); echo htmlentities(print_r($matches,true)); ?> output: Array ( [0] => Array ( [0] => class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060 [1] => class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060 [2] => class="l">85.236.100.103 <img src="/i/launch.gif" title="Launch"/></a></td><td>29060 ) [1] => Array ( [0] => 85.236.100.103 [1] => 85.236.100.103 [2] => 85.236.100.103 ) [2] => Array ( [0] => 29060 [1] => 29060 [2] => 29060 ) )
- November 27, 2008
- 2 replies
multi <div></div> tags matching

ddrudik replied to mike42's topic in Regex Help

Please show what matches you want from that string.
- November 25, 2008
- 2 replies
[SOLVED] regex question.. is this possible?

ddrudik replied to obay's topic in Regex Help

You could also incorporate the height and width params into the regex, but with only your initial requirement: $html=preg_replace('~<object\s[^>]*/swflash.cab[^>]*><param\s[^>]*value=("[^"]*")(??!</object>).)*</object>~is','<a href=$1 style="display:block;width:400px;height:300px" id="player"></a>',$html);
- November 24, 2008
- 4 replies
String does not have a substring in it... In 1 regexp ;p.

ddrudik replied to corbin's topic in Regex Help

$html=preg_replace('#<a\s[^>]*\bhref="(?=(??!/|https?://)[^"])+)#i','$0/',$html); haystack: <a href="blah">bleh</a> With <a href="/blah">bleh</a> <a href="blah/">bleh</a> With <a href="/blah">bleh</a> <a href="http://blah">bleh</a> With <a href="https://blah">bleh</a> output: <a href="/blah">bleh</a> With <a href="/blah">bleh</a> <a href="/blah/">bleh</a> With <a href="/blah">bleh</a> <a href="http://blah">bleh</a> With <a href="https://blah">bleh</a>
- November 23, 2008
- 2 replies
Wrapping paragraph tags around text

ddrudik replied to jordanwb's topic in Regex Help

As for how to handle the other tags, consider similar questions here to handle those as well as what to do with mismatched tags etc.: http://regexadvice.com/forums/thread/45583.aspx http://regexadvice.com/forums/thread/46297.aspx
- November 21, 2008
- 6 replies
Wrapping paragraph tags around text

ddrudik replied to jordanwb's topic in Regex Help

Usually CMS templating systems such as this evolve into something more complex than the example, do you have plans to support [img...] [url...] tags etc.? For your original question: $str=preg_replace('/^[^\r\n]+/m','<p>$0</p>',$str);
- November 21, 2008
- 6 replies
[code] block and tab characters

ddrudik replied to ddrudik's topic in PHPFreaks.com Website Feedback

I was reading the 'Preview' as being the same as the actual post afterwards. The preview window shows them removed. The string below in regex notation appears as: ^\t\t$result['level']=2;$ $result['level']=2; In the Preview it appears in a code block without indent.
- November 20, 2008
- 2 replies
[code] block and tab characters

ddrudik posted a topic in PHPFreaks.com Website Feedback

Tab character seem to fail to indent when displayed within [ code ] blocks here but if I post the code without a [ code ] block the code displays indented without issue. The color markup of the code is all well and good but if it is going to mess with the indent I don't see the benefit. Is there something I am missing? example: function parseline($line){ if(substr($line,178,1)=='2'){ function parseline($line){ if(substr($line,178,1)=='2'){
- November 20, 2008
- 2 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

This is actually a string question and not a regex question. Here's the code required to parse the lines into fields within the files, I will leave it to you to work out the specifics on how you want to compare what. The code and array output of the example (shown with file1 but file2 is parsed with the same code) should give you a start in the right direction. <pre> <?php function parseline($line){ if(substr($line,178,1)=='2'){ $result['level']=2; $result['indent']=substr($line,0,1); $result['address']=substr($line,1,78); $result['city']=substr($line,79,30); $result['state']=substr($line,110,2); $result['zip']=substr($line,113,5); $result['tollfree']=substr($line,123,40); $result['areacode']=substr($line,164,3); $result['prefix']=substr($line,168,3); $result['suffix']=substr($line,172,4); } elseif(substr($line,178,1)=='1') { $result['level']=1; $result['page']=substr($line,0,4); $result['type']=substr($line,5,1); $result['name']=substr($line,7,78); } else { return false; } return array_map('trim',$result); } $lines=file('file1.txt'); foreach($lines as $line){ $fields=parseline($line); if($fields){ echo "<hr>line:<br>$line<br>\$fields "; echo print_r($fields,true); } } ?>
- November 20, 2008
- 12 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

To do that level of detail comparison each line would need to be broken out into fields by a different method than what I used. Your file1 uses \r\n as line separators while file2 only uses \n so that was throwing off my code. My code compares both lines of a record together so it's output is different than from your program. <pre> <?php function showdiff($f1,$f2){ $file1=preg_replace('/ +/',' ',preg_replace('/(.*?)\S+(?=\r\n)/','$1',file_get_contents('file1.txt'))); $file2=preg_replace('/ +/',' ',preg_replace('/(.*?)\S+(?=\r\n)/','$1',preg_replace('/\n/',"\r\n",file_get_contents('file2.txt')))); $f1count=preg_match_all('/(?:.*?\r\n){2}/',$file1,$f1matches); $f2count=preg_match_all('/(?:.*?\r\n){2}/',$file2,$f2matches); foreach($f2matches[0] as $f2line){ if(!in_array($f2line,$f1matches[0])){ echo "missing <font color=red>$f2line</font> from file 1.<br>"; } } foreach($f1matches[0] as $f1line){ if(!in_array($f1line,$f2matches[0])){ echo "missing <font color=red>$f1line</font> from file 2.<br>"; } } } showdiff('file1','file2'); ?>
- November 20, 2008
- 12 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

Please show what specific output you expect when comparing the two file samples as shown in your original question.
- November 20, 2008
- 12 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

They were compared between the two files as two-line records (name etc on line 1 and address etc on line 2). preg_match_all put them into into an array per file and then I compared them between the two arrays.
- November 20, 2008
- 12 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

See if this works for your requirements: <pre> <?php function showdiff($f1,$f2){ $file1=preg_replace('/ +/',' ',preg_replace('/(.*?)\S+(?=\r\n|$)/','$1',file_get_contents('file1.txt'))); $file2=preg_replace('/ +/',' ',preg_replace('/(.*?)\S+(?=\r\n|$)/','$1',file_get_contents('file2.txt'))); preg_match_all('/.*?\r\n.*?(?:\r\n|$)/',$file1,$f1matches); preg_match_all('/.*?\r\n.*?(?:\r\n|$)/',$file2,$f2matches); foreach($f2matches[0] as $f2line){ if(!in_array($f2line,$f1matches[0])){ echo "missing <font color=red>$f2line</font> from file 1.<br>"; } } foreach($f1matches[0] as $f1line){ if(!in_array($f1line,$f2matches[0])){ echo "missing <font color=red>$f1line</font> from file 2.<br>"; } } } showdiff('file1.txt','file2.txt'); ?> output: missing 0003 R ALSTON BRANDY N 310 OBERLIN RD 326-0369 from file 1. The spacing is different between the two files so multiple spaces have been reduced to 1 space and the last column which also differs between the two files is ignored. The comparison is irrespective of location in the file, it is a success if a given 2-line record in file1 is located anywhere in file2 and vice versa. Your last entry in both files was excluded from my testing since they included only line 1 and not line 2, making a comparison of both not possible.
- November 20, 2008
- 12 replies
[SOLVED] HELP: nested tags?

ddrudik replied to mab's topic in Regex Help

A more common way of seeing that is with (?R) instead of (?0) although (?0) helps to illustrate that you could incorporate lookahead and lookbehind and use a capture group 1 as the nested pattern (?1). A complete background is in Friedl's "Mastering Regular Expressions" but for a quick PHP regex syntax overview: http://us3.php.net/manual/en/reference.pcre.pattern.syntax.php Search for "Recursive Patterns" on that page and you will see the discussion of the general pattern, although instead their example matches nested/non-nested parens groups. It is simpler to construct a pattern with a single bounding character such as ( ) versus the table tags but the theory is the same.
- November 19, 2008
- 5 replies
trace a line in a text file

ddrudik replied to homer.favenir's topic in Regex Help

The text files differ quite a bit, 1/2 vs. V1/V2 in the last column etc, is the last column to be ignored from the comparison? Are your real-world text files to be compared line by line exactly or every 2 lines as the records appear to be 2 lines long? Is the order important or can the records appear anywhere in the text file to be considered valid? What about records in file1 that don't appear in file2 (if that should ever occur)?
- November 19, 2008
- 12 replies
Matching place holders starting/ending with {{ }}

ddrudik replied to brick's topic in Regex Help

Is it sufficient to match all {{...}} blocks? If so: <?php $sourcestring="{{param1:123_xyz^param2:xyx^param3:123}}"; preg_match_all('/\{\{.*?\}\}/s',$sourcestring,$matches); echo "<pre>".print_r($matches,true); ?> $matches Array: ( [0] => Array ( [0] => {{param1:123_xyz^param2:xyx^param3:123}} ) )
- November 19, 2008
- 1 reply
[SOLVED] regex... formatting to [0-9]2-[0-2]2-[0-9]4 or similar

ddrudik replied to blueman378's topic in Regex Help

Probably should bound that with ^ and $ to test entire string: <?php $str = "15-03-1991"; echo preg_match('/^\d{2}-\d{2}-\d{4}$/', $str) ? "Good" : "Bad" ; ?>
- November 19, 2008
- 2 replies
[SOLVED] HELP: nested tags?

ddrudik replied to mab's topic in Regex Help

That last code brings up a good point, for every regex function there's a string function that can do the same operation faster and with less overhead.
- November 19, 2008
- 5 replies
[SOLVED] HELP: nested tags?

ddrudik replied to mab's topic in Regex Help

<?php $html='<h1>Some tables and text</h1> <table> <tr> <th>England</th> <th>Paris</th> <th>Munich</th> </tr> <tr> <td> <table> <tr> <th>London</th> <th>Brighton</th> <th>Cambridge</th> </tr> <tr> <td>rain</td> <td>sun</td> <td>wind</td> </tr> </table> </td> <td>sun</td> <td>wind</td> </tr> </table> This is a text between the tables wich should not be removed. <table> <tr> <th>London</th> <th>Paris</th> <th>Munich</th> </tr> <tr> <td>rain</td> <td>sun</td> <td>wind</td> </tr> </table>'; $html=preg_replace('~<table[^>]*>(??>(??!</?table[^>]*>).)+)|(?0))*</table>~is','',$html); echo $html; ?>
- November 18, 2008
- 5 replies
[SOLVED] using reg expr, how do i wrap multiple <a> and </a> around multiple <img /> ?

ddrudik replied to obay's topic in Regex Help

Those were just various code examples for the task at hand. For your last question: $newStr = preg_replace('#<img src=("[^"]*") [^>]+>#', '<a href=$1>$0</a>', $str);
- November 18, 2008
- 6 replies
[SOLVED] another preg_match_all() problem

ddrudik replied to ted_chou12's topic in Regex Help

If the can be any non-tag text: preg_match_all('~<p class="pagination" align=center>[^<]*<a [^>]*>([^<]*)</a>[^<]*<a [^>]*>([^<]*)~',$sourcestring,$matches); If the can be any non-a href tag text (less preferred): preg_match_all('~<p class="pagination" align=center>.*?<a [^>]*>([^<]*)</a>[^<]*<a [^>]*>([^<]*)~s',$sourcestring,$matches);
- November 17, 2008
- 7 replies

Sign In

ddrudik

Posts

Joined

Last visited

Content Type

Profiles

Forums

Everything posted by ddrudik

Crawler Related Problem

make this case INsensitive?

RegEx between two specific tags

RegEx between two specific tags

preg_match_all -> class="l">85.236.100.103 <img src="/i/launch.gif" title="Launc

multi <div></div> tags matching

[SOLVED] regex question.. is this possible?

String does not have a substring in it... In 1 regexp ;p.

Wrapping paragraph tags around text

Wrapping paragraph tags around text

[code] block and tab characters

[code] block and tab characters

trace a line in a text file

trace a line in a text file

trace a line in a text file

trace a line in a text file

trace a line in a text file

[SOLVED] HELP: nested tags?

trace a line in a text file

Matching place holders starting/ending with {{ }}

[SOLVED] regex... formatting to [0-9]2-[0-2]2-[0-9]4 or similar

[SOLVED] HELP: nested tags?

[SOLVED] HELP: nested tags?

[SOLVED] using reg expr, how do i wrap multiple <a> and </a> around multiple <img /> ?

[SOLVED] another preg_match_all() problem

Browse

Activity

Important Information