Hello Regexperts,
I have pulled a regex from the internet to seperate words that start with capital letters.
The regex is here
TEXTJOIN(" ",,REGEXEXTRACT(R158,REGEXREPLACE(R158,"([A-Z][a-z]+)","($1)")))
When I have this:
HummelfilmTamtam FilmThe Imaginarium Films
The Regex works and returns this:
Hummelfilm Tamtam Film The Imaginarium Films
But when I have this
Nordisk Film ProductionNadcon FilmZweites Deutsches Fernsehen (ZDF)
or this
Zentropa International SwedenCanal+Ciné+
The Regex breaks.
It looks like the presence of non alphanumeric characters is causing the problem. Can anyone please help me out and rewrite the Regex so that it works with non alphanumeric characters too?