Originally Posted by chaley
The problem is that the regex requires a space before and after the split indicator. My guess is that in the metadata there is a space after the semicolon but not before.
Try this:
Code:
authors_split_regex = '(?i),?(\\s+(and|with|und|mit|)\\s+|(;|•))'
It still requires spaces before and after the words but doesn't require spaces before or after the semicolon or the •.
|