Jump to content

.josh

Staff Alumni
  • Posts

    14,780
  • Joined

  • Last visited

  • Days Won

    43

Everything posted by .josh

  1. Yep, trim() is definitely the better route. However, to answer your question about preg_replace()... $str = ' --- ----- ------f-oo-oo----'; $str = preg_replace('/^(-|\s)+-(.*[a-zA-Z0-9])[^\-]-+$/' , '$2', $str); // This should be 'f-oo-oo' now - but it produced 'f-oo-o' echo $str; preg_replace() has to match your pattern as a whole in order to make a replacement. Your pattern says: Start at the beginning of the string, match one or more hyphens or "whitespace" characters (but only capture the first matched char), followed by a hyphen, followed by (and capture) 0 or more of any character followed by one number or letter (case-insensitive) (stop 2nd captured group). Then match one of anything that is not a hyphen, followed by one or more hyphens after that, followed by end of string. If you run that against a preg_match, you will see the following matches: Array ( [0] => Array ( [0] => --- ----- ------f-oo-oo---- ) [1] => Array ( [0] => - ) [2] => Array ( [0] => f-oo-o ) ) Your first piece ^(-|\s)+ matches this part: --- ----- ------f-oo-oo---- but your captured group (-|\s) only matches the last green - Then you have a literal "-" after that, which matches the final "-" before "f" (red part): --- ----- ------f-oo-oo---- Next, you have (.*[a-zA-Z0-9]) The .* will greedily match everything until the very end of the string, but then give up characters until it can match the rest of your pattern. Well the next thing in your pattern is your [a-zA-Z0-9] character class which matches a letter or number, so .* will match the blue part, and the character class will match that last "o" (orange part): --- ----- ------f-oo-oo---- But wait..then you have a negative character class that says match anything that is not a hyphen, so the .* has to back up one more time and give up the 2nd to last "o" as well, and then the [a-zA-Z0-9] can match that, and the [^\-] can match the last "o" (black): --- ----- ------f-oo-oo---- So the 2nd capture ($2 - the part with the 2nd parenthesis wrapped) in total is f-oo-oo Finally you have -+$ which matches one or more hyphens and then end of string, which matches the ending green: --- ----- ------f-oo-oo---- Soo...if you want to do it the regex way, a pattern more like this will work: $str = ' --- ----- ------f-oo-oo----'; $str = preg_replace('/^(-|\s)+|(-|\s)+$/' , '', $str); // This should be 'f-oo-oo' now - but it produced 'f-oo-o' echo $str; This pattern says: Start at beginning of string and match one or more hyphens or whitespace characters, OR match one or more hyphens or whitespace characters followed by end of string, and replace with "" (nothing).
  2. If nobody is answering your question, I would also advise that you step back and take a look at the question you asked. 9/10 times I see a question going unanswered not because people don't know the answer, but because the poster doesn't know how to properly ask a question. They tend to assume many things, short of the audience being psychic (and they expect even that sometimes!). Pretend you are someone else reading it and have no prior knowledge of the context for which you are working in. Did you provide enough back-story and detail? Did you provide enough relevant code? Did you clearly and accurately explain the situation, what you are doing, what is not working, what you expect? Etc.. This is especially important with Javascript, because there are literally a billion different things that could be causing something to go wrong. With php, there's only so many things that could be the culprit. The script is being run on one server and doesn't care about multiple browsers etc.. It is a single script running and the only other scripts that might be an issue are what you specifically include in php. But with Javascript...you have to worry about different browsers and browser versions. Even javascript syntax itself varies from browser to browser! You have to worry about random addons/extensions/plugins adversely affecting something. You have to worry about whatever other script includes are on the page, like frameworks, etc..
  3. Kinda sidetracking and nitpicking here...but if you are offering something that others don't offer, doesn't it go without saying that you are doing a better job than them at it? I mean, if you are the only one offering something, you are by default the one offering the best of it. One thing does not go hand in hand with the other, only because you're offering something which the others don't does not mean you're better off, people may not need, want, or even hate the unique feature you're offering. That special something you're offering must be a killer feature which will make people switch sides. Right. You can offer something that is complete crap, but that's not really the point. Whether you make $1 or $1,000,000 off it, you're still doing better than everybody else making $0 off it, because they are not doing it at all. This is true as well, however, this doesn't fall under the "Offer something that the others don't" category.
  4. Kinda sidetracking and nitpicking here...but if you are offering something that others don't offer, doesn't it go without saying that you are doing a better job than them at it? I mean, if you are the only one offering something, you are by default the one offering the best of it.
  5. Well, if you know your $eventDescLong variable is only ever gonna hold that one iframe, or more accurately, one place where you have a width=".." and height=".." then you can use the same patterns, though just work directly with $eventDescLong instead of using $video: $eventDescLong = preg_replace('~width\s?=\s?["\'][^"\']+["\']~i', 'width="580"', $eventDescLong); $eventDescLong = preg_replace('~height\s?=\s?["\'][^"\']+["\']~i', 'height="387"', $eventDescLong); But if you know you will have other things in $eventDescLong (like images) that will have a width=".." or height=".." then you can do this: $eventDescLong = preg_replace('~(<iframe[^>]*width\s?=\s?["\'])[^"\']+(["\'])~i', '${1}580${2}', $eventDescLong); $eventDescLong = preg_replace('~(<iframe[^>]*height\s?=\s?["\'])[^"\']+(["\'])~i', '${1}387${2}', $eventDescLong);
  6. $eventDescLong = '<iframe width="425" height="349" src="http://www.youtube.com/embed/oSxWqfayU9E" frameborder="0" allowfullscreen></iframe>'; $video = preg_replace('~width\s?=\s?["\'][^"\']+["\']~i', 'width="580"', $eventDescLong); $video = preg_replace('~height\s?=\s?["\'][^"\']+["\']~i', 'height="387"', $video);
  7. Cool story bro, but I meant for you to post an example of the actual value of $eventDescLong
  8. Post the content you are working with ($eventDescLong). Kinda hard to correct or point in the right direction when we don't know the context...
  9. at least you have someone there to guide you. My first day on the job has usually involved being hired because the previous 1-man-dev-crew left, so I got to walk in the door and figure out what was going on myself.
  10. $number = array_pop(explode(',',$string));
  11. gah, okay, I just ran it on a test table in my own phpmyadmin. Upon further reflection, since you are running from php and inside double quotes, you actually have to escape the escape too mysql_query("SELECT domain FROM domains WHERE domain REGEXP '^.{0,4}\\\.'");
  12. oh okay, you need to double escape: '^.{0,4}\\.'
  13. alternatively, you should be able to use $session->username if you have session running..and that is a valid object/property.. where are you trying to put session_start()? It should be at the very beginning of your ajax_more.php file, before any other output.
  14. url: "addons/functions/ajax_more.php?name=<?php echo $session->username; ?>", then use $_POST['name'] in ajax_more.php
  15. post your query string
  16. This won't work. First off, you need to wrap $1 in quotes in the replace argument. 2nd, this pattern will greedily match everything to the end of the string and then give up characters until it finds a ] so IOW it will actually match for the last occurrence of ] in the string. And then the $1 will replace everything it matched before that so IOW this pattern just removes the last occurring ] in the string preg_replace approach would be more like "~^([^\]]*).*~" Basically you have to use a pattern that matches the entire string. More efficient would be to use preg_match and just match the bit you actually want. More efficient would be to use other functions like strpos, substr, explode, etc..
  17. $string = array_shift(explode(']',$string));
  18. You said you wanted to test the string length before the first decimal. You then provided an example regex that showed 0-4 chars, and showed that even though "test.com" should match, presumably because even though " test.com" is 8 characters long, "test" is 4 characters. The pattern starts at the beginning of the string and matches 0-4 characters and then a literal dot. Yes, it will match for instance "test.com" or "test.someotherrandomstuff", because that is what you asked for. If this is not doing what you expect, then be more specific about what it is you are wanting to do and provide several examples.
  19. need to change (.*?) to (.*) Since .*? is a lazy match, and since it is the last thing in the string to match, it fulfills the pattern by matching nothing.
  20. "^.{0,4}\."
  21. I would also like to point out (from a more general perspective) that comparing your salary to others' around the world isn't exactly fair in many cases, as there is a huge cost-of-living difference between many countries. For instance, I hear lots of people around the world live off of a few american dollars a day...well, living off a few dollars a day isn't really possible in america. Your strongest asset is your low cost-of-living. Focus on being able to communicate with others around the world and you will always have the advantage, because you can afford to work for cheaper.
  22. I guess one way to do it: $range = array_merge(range(0,89),range(181,360)); $number = $range[array_rand($range)]; echo $number;
  23. couple things to note about the pattern AbraCadaver provided: preg_match_all('/id="r4_([\d]+)">/', $string, $matches); \d doesn't have to be wrapped in a character class, it's kind of it's own character class, so you can just do \d+ But \d technically matches more than numbers, so if you want to be more specific, use [0-9] instead: preg_match_all('/id="r4_([0-9]+)">/', $string, $matches);
  24. If you are looking for an easy fix... Your mistake is the .* here : Change it to .*? If you want to actually learn something, read this tl;dr.... * is a quantifier. Quantifiers are by default greedy (versus lazy). The difference between greedy and lazy matching is this: A greedy quantifier will match everything it can possibly match, and then start giving back stuff if it has to, to satisfy whatever comes after it in your pattern. A lazy quantifier will creep forward and only match until the first time it hits something that matches the rest of your pattern. Here is an example. I have a string, and I want to match everything up to "O". $string = "XXXXOXXXXXOXXXXXO"; preg_match("/.*O/",$string,$greedy); preg_match("/.*?O/",$string,$lazy); Pattern 1 (greedy) : "/.*O/" With this pattern, what happens is .* will first match everything it can match. the dot is a "match all" character that match (almost) everything. So first it matches literally the whole string: "XXXXOXXXXXOXXXXXO" But then the pattern ends with an O so it has to give up characters until it satisfies that part of the pattern. So it gives up characters 1 at a time until it finds the first O: "XXXXOXXXXXOXXXXXO" And then the O in your pattern matches the last O in the string and $greedy will return the full string. Pattern 2 (lazy) : "/.*?O/" So in this pattern, a ? is thrown into the mix. The .* part of the pattern still matches (almost) everything, but the ? tells it to be a lazy match. With this pattern, what happens is .*? Will match only up to the point where the rest of the pattern can be fulfilled. So instead of consuming everything and then moving backwards to give up stuff until the rest of the pattern can be fulfilled, it moves forward until it finds the first instance in which the rest of the pattern can be fulfilled: "XXXXOXXXXXOXXXXXO" So the overall takeaway here is that greedy quantifiers will consume everything it can and then give up only what it has to, to satisfy the rest of the pattern. If the pattern can be fulfilled in more than one place in your string, it will be fulfilled from the last instance in the string (based on what the greedy quantifier actually matches). Whereas lazy quantifiers will move forward and consume only to the point where the first instance of the rest of the pattern occurs. So for your pattern, you should be doing this: function clb($matches) { var_dump($matches); } preg_replace_callback('/<line +exp *= *"([a-zA-Z0-9\.\-\ ]*)">.*?\<\/line\>/siu', "clb", $str); However, a couple notes here... First off, you say you want it to display the following: array(4) { [0]=> string(7) "content" [1]=> string(9) "something" [2]=> string( "content2" [3]=> string(10) "something2" } This isn't going to happen with preg_replace_callback(). What you will actually see is two separate calls to clb() and two var_dumps, not one single var_dump: one for each matched pattern. If you are looking to just match something (versus replacing it with something else), you should be using preg_match (if you are wanting to just match the first occurrence) or preg_match_all (if you do indeed want to grab all occurrences). Another thing to mention is your pattern is a bit bloated and inefficient. So you have this: '/<line +exp *= *"([a-zA-Z0-9\.\-\ ]*)">.*?\<\/line\>/siu' First off, you have a lot of unnecessary escaping going on. You only need to escape characters that mean something special to the regex engine. So you don't need to escape < and you don't need to escape . or - or the space in the character class (well, maybe the hyphen, with the way you have it positioned in there, but you can move it to the front or the back of the list to avoid having to escape it). Also, you can avoid escaping the / by using a different delimiter for your pattern. You can use pretty much any non-alphanumeric character as your pattern delimiter, so it's usually a good idea to use something other than a / if you are trying to parse html, because / creeps up in html a lot, and therefore you have to escape it and make your pattern more bloated and harder to read. So your pattern can be cleaned up as so: '~<line +exp *= *"([-a-zA-Z0-93. ]*)">.*?</line>~siu' But there's more that can be done. So back to greedy and lazy quantifiers... all fine and dandy, except .*? isn't really efficient. In order for it to work, it still has to look ahead and find where the rest of the pattern can be satisfied. This is extra work for the regex engine. The ideal thing is to have the quality of the greedy quantifier where it can just match everything it's supposed to, but not have to turn around and backtrack to give stuff up. So there are two ways we can do that: with a negative character class, or with a negative look-ahead. negative character class You already have a character class in your pattern: [-a-zA-Z0-93. ] This is a positive character class, which tells the regex engine to match any one of those characters (literal characters or within the ranges, like a-z), and your * quantifier after that says match 0 or more of that character class. A negative character class tells the regex engine to match anything not specified. A negative character class starts with ^ as the first character inside the square brackets. So, going back to my XO string example: $string = "XXXXOXXXXXOXXXXXO"; preg_match("/.*O/",$string,$greedy); preg_match("/.*?O/",$string,$lazy); preg_match("/[^O]*O/",$string,$ncc); matches: .* : "XXXXOXXXXXOXXXXXO" .*? : "XXXXOXXXXXOXXXXXO" [^O]* : "XXXXOXXXXXOXXXXXO" So here you see that the negative character class will essentially match the same thing as the .*? lazy quantifier. But the difference is that the negative character class doesn't have to look ahead and try to match the rest of the pattern. It just looks at the next character and says "are you in this character class? No? Okay matched. Yes? Okay stop matching." So how can negative character class be applied to your pattern? Well, you could use [^<]* and it will match everything up until the next < but that assumes that the content between your tags won't ever have a < in it, and that the next < will be from </line>. If you know this is the case, then that is what you should use, as it will be more efficient than .*? for sure, but also more efficient than a negative look-ahead. But if that is not the case... the next best option would be a negative look-ahead. Negative Look-ahead Okay so the problem with a negative character class is that it boils down to matching a single character (though it can match anything specified in the class, it still only matches one character at a time). But (not) matching for a single character isn't really enough in some cases, especially when it comes to matching a delimiter that is more than one character (like closing html tags). Look-ahead gives you the ability to look ahead to more than one character at a time. It looks like this: (?!) And you put what you want to look for between the ! and ) so for instance (?!</line) So (?!</line>) just looks ahead to see if that is what is next in your string, but it doesn't actually match anything (the official term for this is "zero width assertion"). You still need to use something to actually match for the content. Since we want to match anything that is not </line> we can use a dot. ((?!</line>).) The outer (...) is to capture what you are matching. So basically this first looks ahead to see if "</line>" is not there, and then matches the next thing, which is (almost) any one character. But we want to match more than one character. Just like with .*? we want to match everything up to </line>. This is where it gets kind of tricky... We could try... ((?!</line>).*?) But this won't actually work. What really happens here is that it tries to match everything in your string (lazy) - .*? - not followed by </line>. Since this is a lazy match, it moves forward, looking for </line> and since it occurs later on in your string as a whole, the .*? will not match anything. If we were to change it to this (make it greedy): ((?!</line>).*) ...it would match something..but not what you want! It will greedily consume your entire string and since there is nothing after the end of your string, the negative look-ahead still works because </line> is not matched. I know, that's probably the most confusing part of this, and has tripped me up for the longest time. The solution here is to use that negative look-ahead to look ahead and then match one character at a time: ((??!</line>).)*) So first we have the core (?!</line>) which is the negative look-ahead. Then we have (?:(?!</line>).) which makes this as a group, but does not capture it. Then we have the * to quantify it, match 0 or more of that group, and wrap all that in (..) to actually capture it. So now it will look at things one character at a time and see if the character is part of the </line>. If not, add it to the group to capture. example string: "foo<bar</line>" Is 'f' part of </line>? Nope, put it into the captured group. Is 'o' part of </line>? Nope, put it into the captured group. Is 'o' part of </line>? Nope, put it into the captured group. Is '<' part of </line>? Yes! It's the same as the starting character of </line> BUT overall, the next bits of the string will not match </line> because the next character after that is a b not a /, so it does not match, put it into the captured group. Is 'b' part of </line>? Nope, put it into the captured group. Is 'a' part of </line>? Nope, put it into the captured group. Is 'r' part of </line>? Nope, put it into the captured group. Is '<' part of </line>? Yes! and the following characters will all match up to </line>, so there is nothing else to capture. So overall, we now have: '~<line +exp *= *"([-a-zA-Z0-93. ]*)">((??!</line>).)*)</line>~isu' So overall, the regex engine is still having to look head in the string and see if something is there, so it's not quite as efficient as a negative character class, but in many cases, a negative character class isn't really an option, and this is still more efficient than having to look ahead at the entire rest of the pattern when you use a lazy match all .*? But we can still make this a bit more streamlined. You have that positive character class to match and capture the value of the exp attribute. Unless you are specifically wanting to match values that only have those characters (like for instance, you don't want to match if exp="foo_bar"), you can make this more efficient by applying a negative character class instead, by just matching anything that is not a closing quote. And what about if single quotes are used instead of double (single quote is escaped because that's the quote you are using to wrap this whole pattern for your function call)? '~<line +exp *= *[\'"]([^\'"]*)[\'"]>((??!</line>).)*)</line>~isu' Also I see that you attempt to quantify the spaces in there, which means you're trying to match in case there is a difference in spacing. But what if there are other attributes in that opening line tag? Your pattern currently only matches if exp is the first (and only) attribute. Also, it doesn't consider any errant spacing that might occur between the " and >. Also, most people use either 0 or 1 spaces between equal signs and attributes, so you can probably safely use a ? instead of * which is technically more efficient. But if you want to cover just-in-cases, you may as well just use \s instead of a space because it is easier to read \s than " ". \s will match any "white space" character (like a space from spacebar, or a tab, etc..), so technically it will match more than just a spacebar space but it's still whitespace and \s is easier to see in a pattern than a blank space. So overall, a more flexible, efficient pattern would be: '~<line[^>]*exp\s*=\s*[\'"]([^\'"]*)[\'"][^>]*>((??!</line>).)*)</line>~isu' So...using your example content... <?php $str = <<<CONTENT <line exp="something"> content </line> <line exp="something2"> content2 </line> CONTENT; function clb($matches) { var_dump($matches); echo "\n"; } preg_replace_callback('~<line[^>]*exp\s*=\s*[\'"]([^\'"]*)[\'"][^>]*>((??!</line>).)*)</line>~isu', "clb", $str); ?> outputs (from rightclick > view source): array(3) { [0]=> string(38) "<line exp="something"> content </line>" [1]=> string(9) "something" [2]=> string(9) " content " } array(3) { [0]=> string(40) "<line exp="something2"> content2 </line>" [1]=> string(10) "something2" [2]=> string(10) " content2 " } But again I have to question whether or not you should be using preg_replace_callback vs. preg_match or preg_match_all.. maybe you are just var_dumping to test the match, and you really do intend to replace something, but if you are just looking to match things and use those matches somewhere else and do not intend to actually change $str, you should be using preg_match or preg_match_all instead.
  25. .josh

    preg_match

    "/^[-a-z0-9_ ]+$/i"
×
×
  • Create New...

Important Information

We have placed cookies on your device to help make this website better. You can adjust your cookie settings, otherwise we'll assume you're okay to continue.