[SOLVED] searching

-entropyman · July 20, 2008

Hello,

need someone's help again. I'm attempting to write a script that searches a .htm for a specific number. i want to find the line that the $number is on and return only that line. here's what i've got

$image = 'http://xx.x.xx.xx/######.png';
        $server = 'http://xx.x.xx.xx/xxxxxxx.htm'
$number = basename($image, ".png");
$ns = file_get_contents($server);
if (preg_match ($number, $ns))
{
	echo "match was found";
}
else
{
	echo "failure!";
}

now i'm pretty sure my preg match line is wrong but i'm not sure how to fix it.

after i fix the preg match i'm getting rid of that if but i'm not sure how to return the line it's on.

any ideas?

and the htm page is setup like this

00001 testtestest test testwe
00002 testtestest test testwew
00003 testtestest test testrrrrrrr
00004 testtestest test testeee
00005 testtestest test testrrrrrr

any ideas?

???

cooldude832 · July 20, 2008

way I do it would be

<?php
$image = 'http://xx.x.xx.xx/######.png';
$server = 'http://xx.x.xx.xx/xxxxxxx.htm'
$number = basename($image, ".png");
$content = file_get_contents($server);
$data = explode("\n",$contnet);
$matched = array();
foreach($data as $key=>$value){
if(stristr($value,$number)){
$matched[] = $key;
}
}
print_r($matched);
?>

a regular expression could be more useful if you knew it was in a <img> tag for example

jonsjava · July 20, 2008

mine is more lengthy, but here it is:

<?php
$image = 'http://xx.x.xx.xx/######.png';
$server = 'http://xx.x.xx.xx/xxxxxxx.htm';
$number = basename($image, ".png");
$ns = file_get_contents($server);
$data_array = explode("\n", $ns);
$result_array = array();
$count = 0;
foreach ($data_array as $value){
if (strstr($value, $number)){
	$result_array[] .= $value;
	$count++;
}
}
if ($count = 0){
print "failed to find number";
}
else {
print "Found $count results. Results as follows:\n<br />";
foreach ($result_array as $value){
	print $value."\n<br />";
}
}

cooldude832 · July 20, 2008

its the same exact thing with a bit of fancy error reporting/outputting

jonsjava · July 20, 2008

I didn't steal your script. I had written it, and hate to waste code. I noticed your response after I completed the script.

-entropyman · July 20, 2008

wow i love you two

thank you!

cooldude832 · July 20, 2008

A note for Jonsjava is you don't need that variable $count because you can simply count the array and say

<?php
if(count($result_array) == 0){
#no results
}
?>

jonsjava · July 20, 2008

all too true. also noticed how similar our scripts are. freaky. GET OUT OF MY HEAD *lol*

.josh · July 20, 2008

cooldude you explode everything into a giant array and then run a loop on each "word" and check if it matches $number and if it does it saves the array key for that giant array so your code will just return for example (in an array) "0 8 12 ..."

jonsjava you also explode everything into a giant array and then run a loop on each "word" and check if it matches $number and if it does, it saves the array value from the position in that giant array so your code will just return for example (in an array) "00003 00003 00003 00003"

-entropy: Assuming that the point of finding the line is to get the data on the line and separate it for use, here's my take:

<?php
  $image = 'http://xx.x.xx.xx/######.png';
  $server = 'http://xx.x.xx.xx/xxxxxxx.htm'
  $number = basename($image, ".png");

  $list = file($server);
  foreach ($list as $key => $val) {
     if (stristr($val, $number)) {
        // accommodates if there's more than 1 row
        $row[] = explode($list[$key]);
    } // end if $number in $val
  } // end foreach $list

edit: oops my bad both of you. For some reason I looked at \n as a space well anyways, you can skip the explode part by using file instead of file_get_contents

jonsjava · July 20, 2008

um...mine goes line-by-line. not word-by-word. "\n"

.josh · July 20, 2008

edit: oops my bad both of you. For some reason I looked at \n as a space Shocked well anyways, you can skip the explode part by using file instead of file_get_contents

I edited my prev post I noticed that sorry.

cooldude832 · July 20, 2008

using file()

and using

$var = file_get_contents()

$data = explode("\n",$var);

produces the same array so I don't see your method saving resources.

Our method in turn actually is better assuming you find a match because you then have the data broken into lines so you can directly say

foreach($matched as $value){

echo $data[$value];

}

The point is to find the matches the output for the function is the end users own method we just pointout how to do it simply (most ppl asking a question know how to format output in a method they like)

.josh · July 20, 2008

file breaks the file into an array line by line so you don't have to do the explode. The end result is the same it's just 1 less line of code.

cooldude832 · July 20, 2008

I see it as an over treatment of a variable because in turn if the file I am working with is of reasonable size and I want to re apply conditions to the original file (say striptags) I can apply this to the unexploded file and then do additional treatment as needed.

I've done a lot of reverse engineering of formatted outputs back into database and in my experience using the file_get_contents combined with preconditioning and then exploding into lines (or on <td> tags as I find to be very common) the output is easier to work with.

I also can recall the entire file from a variable if needed to be inserted.

However if I'm working on a grossly oversized file then using the file() method is better for saving resources, however a grossly oversized file in a .htm file is hard to find because its a glorified text file in all reality.

.josh · July 20, 2008

True your method does preserve the file as a whole in case you need it I'll give you that one. Though, I can't really think of a situation off the top of my head where you'd actually need both. I mean, trying to parse one thing might be easier by the line, while trying to parse another might be better doing it as a whole, but you can accomplish any task from just one or the other, so in the end, having 2 copies of the same information loaded in memory is always gonna be harder on the computer than just executing another line of code.

cooldude832 · July 20, 2008

so in the end, having 2 copies of the same information loaded in memory is always gonna be harder on the computer than just executing another line of code.

Your talking aggregate on modern servers running 4-64gigs of ram if the file is under 100kb which 99% of html files are

If you don't believe me about 64 gig of ram on a server

(http://www.newegg.com/Product/Product.aspx?Item=N82E16813151008)

And yes 64-bit architecture can address all of it 2^64 = 1.84467441 × 10^19 which is about 1.84 X 10^10 gigs of ram (more than anyone needs)

jonsjava · July 20, 2008

Oh, and FYI: This page (yes, this very page, page one, that is) comes in at around 11.5kb. this is a large page by most standards.

.josh · July 20, 2008

Well I do admit that in the end we're just nickel and diming here

-entropyman · July 20, 2008

as i reread this code i realize i forgot to tell you something very important . the reason i broke the url to look for number is that the number won't be repeated in this htm file. thus only one result will ever be return. similarly, i am also going to take the line return and further manipulate it. so i need to store the line in a variable? i'm not sure how to fix your code to do that. i'm still very new ^^

thank you again

cooldude832 · July 20, 2008

modifying my example a bit

<?php
$image = 'http://xx.x.xx.xx/######.png';
$server = 'http://xx.x.xx.xx/xxxxxxx.htm'
$number = basename($image, ".png");
$content = file_get_contents($server);
$data = explode("\n",$contnet);
foreach($data as $key=>$value){
if(stristr($value,$number)){
$matched = $key;
#watch it break
break;
}
}
#the row is now
$row = $data[$matached];
?>

jonsjava · July 20, 2008

beating a dead horse...

Well, it's good to see that people are observant tonight. I haven't seen such lively chatter about such a small memory/speed difference before. I gotta say that I like it.

-entropyman · July 20, 2008

hmm i can't seem to get that to work. just outputting a blank page. can't get crayon's to work or your original either. i can only get jonsjava to work.

any ideas?

???

cooldude832 · July 20, 2008

I spelled $content wrong in the beginning to see if u see it.

jonsjava · July 20, 2008

lol. nice save.

-entropyman · July 20, 2008

yes i fixed $content and $matched but still nothing. ideas?

Sign In

[SOLVED] searching

Recommended Posts

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Archived

Important Information