jokerbla Posted July 7, 2010 Share Posted July 7, 2010 Hi, I'm doing a script that extracts words(only once, unique) from multiple text files (all in a folder). I've used regexp before, but I forgot the syntax for the patern. So far for extraction from one file I have: <?php $file = file_get_contents ('1.txt'); $patern= '???'; $sp = preg_split('/ /', preg_replace($patern,' ',$file)); $sp = array_unique($sp); What do I write at the patern? I need to replace all other characters(like :, "- ',' . )( "�' ) except letters, with spaces so I can later extract everything except space and write it down in another txt file. Link to comment https://forums.phpfreaks.com/topic/207027-word-extraction-script/ Share on other sites More sharing options...
jokerbla Posted July 7, 2010 Author Share Posted July 7, 2010 Sort of figured the above out. I have another question. How do I replace a character lets say " - " followed by a new line or a line break \n \t ? $file = preg_replace('/ [\'-\r\']|[\'-\n\'] /','',$file)); This doesn't seem to work. Link to comment https://forums.phpfreaks.com/topic/207027-word-extraction-script/#findComment-1082591 Share on other sites More sharing options...
ZachMEdwards Posted July 11, 2010 Share Posted July 11, 2010 You'd use a pattern like: '/-[\r\n\t]/' Link to comment https://forums.phpfreaks.com/topic/207027-word-extraction-script/#findComment-1084352 Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.