[SOLVED] Detecting if a string ends with another

jordanwb · August 24, 2008

What I want to do is find out if a file of a given file path ends with "Theme.php". Example:

./themes/FiveAreaTheme/FiveAreaTheme.php returns true

./themes/FiveAreaTheme/CssBuilder.php returns false

I know I'd do something along the lines of this:


<?php

$file_path = "/foo/bar";

if (eregi ("Theme.php", $file_path))
{
do_stuff ();
}

?>

ibechane · August 24, 2008

All strings end with a special invisible end character. The dollar-sign ($) is the regex symbol that represents this end character:

eregi("Theme.php$",$file_path)

Does this work?

ibechane · August 24, 2008

One other thing... it may be coincidence that your file name has a dot symbol (.) in it, but that is in fact a regex special character for "any character."

Therefore, your regex test as is will return true for FiveAreaTheme.php as well as FiveAreaThemeXphp or FiveAreaTheme!php. To escape the character, put a backslash before it:

eregi("Theme\.php$",$file_path)

jordanwb · August 24, 2008

Yep, that did it thanks.

nrg_alpha · August 25, 2008

I would consider using preg instead of ereg. As of PHP 6, ereg (well, POSIX in general) will no longer be supported. Here is an exerpt from a page on IBM's site:

http://www.ibm.com/developerworks/opensource/library/os-php-future/?ca=dgr-lnxw01PHP-Future

"The ereg extension, which supports Portable Operating System Interface (POSIX) regular expressions, is being removed from core PHP support. If you are using any of the POSIX regex functions, this change will affect you unless you include the ereg functionality. If you're using POSIX regex today, consider taking the time to update your regex functions to use the Perl-Compatible Regular Expression (PCRE) functions because they give you more features and perform better. "

Basicaly, if you still have ereg stuff in your webpages and your hosting provider updates to PHP6, you'll be in for a little surprise. Granted, I think many hosting providers would give a grace period prior to fully switching over.. but some may not..either way, make your site PHP6 proof as soon as possible so that there is no issues once servers everywhere adopt it.

Cheers,

NRG

jordanwb · August 25, 2008

All right so would what I use instead of ereg?

I saw a bit on that page about GD libraries. Is that the image libraries that do stuff like resizing?

nrg_alpha · August 25, 2008

All right so would what I use instead of ereg?

I saw a bit on that page about GD libraries. Is that the image libraries that do stuff like resizing?

GD is a graphics library.. this has nothing to do with regular expressions.. you should use preg (which is part of PCRE.. which stands for Perl Compatible Regular Expressions).

Scroll down the page in the link until you see the sub heading EREG..

So instead of using ereg such as in your initial example:

$file_path = "/foo/bar";
if (eregi ("Theme.php", $file_path))
{
do_stuff ();
}

You would use PCRE instead:

$file_path = "/foo/bar";
if (preg_match('#Theme\.php#i', $file_path))
{
do_stuff ();
}

You can read up on PCRE here:

http://us3.php.net/manual/en/book.pcre.php

You can read up on preg_match here:

http://us3.php.net/manual/en/function.preg-match.php

The preg setup is different than ereg (needing delimiters for example), but trust me, once you get the hang of preg (PCRE in general..), you won't look back.

This link can help you get started as well:

http://www.regular-expressions.info/index.html

Hope that helps.

Cheers,

NRG

jordanwb · August 25, 2008

You would use PCRE instead:

$file_path = "/foo/bar";
if (preg_match('#Theme\.php#i', $file_path))
{
do_stuff ();
}

Ok Thanks. I have three other eregi patterns that I would have to convert for preg and I don't have a clue where to start

nrg_alpha · August 25, 2008

I would start by examining those extra links at the bottom of my last post

But just to give you rough guidance..

ereg(), eregi() = preg_match()
ereg_replace(), ereg_replacei() = preg_replace()

Just start familiarizing yourself with preg_match and preg_replace functionality. Learn about the fact that you need delimiters inside your pattern like so:

preg_match('# pattern #' , ' replace ', string); // the # characters are the delimiters.. they can be anything so long as it matches.. '% ... %' for example..

Just read up on it and use some of the examples found in the php.net manual links.. you'll get it soon enough..then you'll realise how much better preg is vs ereg.

jordanwb · August 25, 2008

I found one regex for checking an email address:

^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$

Is that for ereg or preg_match or does it matter? I also noticed that that regex won't work for .ca based email addresses. ::)

nrg_alpha · August 25, 2008

I found one regex for checking an email address:
^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$
Is that for ereg or preg_match or does it matter? I also noticed that that regex won't work for .ca based email addresses.

My advise is to not use ereg (support for this will be dropped as of PHP 6) So I would stick with preg.. its more robust anyways. As for your pattern there, the reason it doesn't support .ca is that the ca portion is not included in final aspect of the pattern.. try:

^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|ca|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$

Notice the ca right after the com? But a word of caution about email addresses.. if your expression is too harsh (too strict / rigid), then you might be missing out on some emails.. One code snippet I stumbled upon doesn't use any PCRE (at least, on the surface, it doesn't seem to..perhaps internally it does.. not sure).. here is what I recommend:

function validemail($email){
   $checkEmail = filter_var(filter_var($email, FILTER_SANITIZE_EMAIL),FILTER_VALIDATE_EMAIL);
   if($checkEmail === false){
      return false; // houtson, we have a problem...
   } else {
      return true; // houston, all systems go!
   }
}

The idea here is simple:

Say your email is in a variable called $_POST['email']... you pass this into the function, which in turn sanitises and validates using PHP's own built in functionalities. But this validation system is not strict.. it is much more 'relaxed'.. so yes, emails can contain formats that are not quite right.. but it is not too loose.. there are 'reasonable' limits. So it is kind of a not too strict, not too loose kind of thing (from what i read, this is the more recommended method.. that way, it lets in some perfectly good emails that would otherwise be blocked...case in point.. your pattern does not contain '.co.uk'.. so what if someone with that at the end of their email address tried to send you an email?) The system I showed allows for it.

Just a suggestion.

jordanwb · August 25, 2008

My advise is to not use ereg (support for this will be dropped as of PHP 6)

Yeah I know.

What I meant by "does it matter?" is that patterns for ereg work with preg_match automatically?

nrg_alpha · August 25, 2008

I beleive so. To my understanding, preg has pretty much everything ereg has and more..So you can plug your pattern into a preg_match() and try it out.

jordanwb · August 25, 2008


$pattern = "^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|ca|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$";
$email = "[email protected]";

print preg_match ($pattern, $email) ? "True" : "False";

Output:

Warning: preg_match() [function.preg-match]: No ending delimiter '^' found in /var/www/localhost/htdocs/scm/index.php on line 24
False

I changed the $ at the end to a ^ and it says false.

effigy · August 25, 2008

PREG requires delimiters. Do you need your pattern to be that specific?

jordanwb · August 25, 2008

Do you need your pattern to be that specific?

Well if it would detect valid emails apart then I don't see why not.

PREG requires delimiters.

So it doesn't matter what I use a delimiter as long as it's not already used in the pattern and it has to be at the beginning and the end.

So why doesn't that regex work since the delimiter isn't used anywhere else in the pattern?


<?php

$pattern = "^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|ca|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)^";
$email = "[email protected]";

print preg_match ($pattern, $email) ? "True" : "False";

?>

Output:

False

Could it be because of casing?

jordanwb · August 25, 2008

Could it be because of casing?

It is.

effigy · August 25, 2008

Your delimiter should typically be a character that is not a metacharacter, e.g., /pattern/. You also need the case insensitive modifier--/pattern/i--and to restore your beginning of line and end of line anchors--/^pattern$/i. I was asking about the specific domain names because (1) it's going to cause more work; and (2) that's not all of them.

jordanwb · August 25, 2008

Your delimiter should typically be a character that is not a metacharacter, e.g., /pattern/. You also need the case insensitive modifier--/pattern/i--and to restore your beginning of line and end of line anchors--/^pattern$/i.

Um ok. ???

I was asking about the specific domain names because (1) it's going to cause more work; and (2) that's not all of them.

Good point.

nrg_alpha · August 25, 2008

As effigy mentioned... you need delimiters.. this is one of the immediate (and perhaps confusing) differences between preg and ereg.. so.. your current pattern is:

$pattern = "^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|ca|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$";

At the very begninning and very end of your pattern , you must add delimiters.. Perl Compatible Regular Expressions is very flexible with regards to this.. you can have anything you want (I think.. effigy can correct me on this if I am wrong).. so the default delimiters you see most often is '/' or either side.. but I personally like '#' (this is a drasw back for commenting inside patterns, but this is another story entirely). So using my style, examine what you have with this:

$pattern = "#^[A-Z0-9._%+-]+@[A-Z0-9.-]+\.(?:[A-Z]{2}|com|ca|org|net|gov|mil|biz|info|mobi|name|aero|jobs|museum)$#i"; the i outside the delimiter means upper or lowercase characters...

Notice the # characters, one at each end of this pattern? You can use '/' as delimiters if you like.. but understand that whatever character you choose as a delimiter, if that same character is needed as part of a match inside your pattern, you must escape it with a backslash '\'. so if I needed to use the # character as part of my pattern, I would need to do this:

$pattern = '# ...\# ... #'; // otherwise, without escaping, regex would confuse my # character inside as a closing delimiter...

Some 'safer' (read.. less likely to need escaping) alternative delimiters could be { ... } or % .... %. But no matter what.. you need delimiters.

jordanwb · August 25, 2008

Ok thanks.

nrg_alpha · August 25, 2008

I know this seems slightly confusing and frustrating at first.. but trust me.. after you do a few examples with preg (with delimiters of course ), you'll see it isn't that difficult to grasp.. the trickier part about regex in general is not the syntax, but rather the rules of engagment. Some characters inside a pattern can have their meanings changed (depending on where they are located.. that is to say, inside a character class or not).. but this all comes with simple practice and a little research on web tutorials delaing with PCRE.

Trust me.. once you start to grasp regex, you'll wonder how you managed without it.

jordanwb · August 25, 2008

Now I have to figure out a regex for a username with spaces.

ibechane · August 25, 2008

It's important to understand why using ^ as a delimiter won't work. If you're still confused about that, take a look at the first table in this regex cheat sheet:

http://regexlib.com/CheatSheet.aspx

The metacharacter (^) refers to the beginning of a string. The metacharacter ($) is the end string (remember.. from my first response?).

When you search a string with a regex like "abc," it will return a match within any of the following: "abc", "aabc", "aabcasdfjaskdjf". However, your original question stated that you wanted your string "Theme\.php" to be at the end of the string, that is, nothing should come after it. Changing your regex string to "Theme\.php$" will find a match in "ajaskdjfa;dji2348uTheme.php" but not "Theme.php1" because the character "1" does not match the regex $ metacharacter, which says to look to see if that is the end of the string.

You can think of delimiters as a set of parentheses that captures all of the regex string, except rather than actual parentheses, you can use any symbol that ISN'T a metacharacter (again, refer to the cheat sheet). The reason delimiters are used is that after the closing delimiter, you can put in flags like "i" or "g" to specify other options, like case-insensitivity and global searching.

jordanwb · August 25, 2008

So it would be like this:


if (preg_match('#Theme\.php$#', $file))
{
include ($file);
}

Sign In

[SOLVED] Detecting if a string ends with another

Recommended Posts

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Link to comment

Share on other sites

Archived

Important Information