Vielleicht Vielleicht Vielleicht
Revision as of 21:59, 18 January 2012 by Marie Wocher (talk | contribs)
A little script to extract all sentences in a file beginning with a certain word, in this case: "vielleicht"
<source lang="python">
- !/usr/bin/env python
- -*- coding:utf-8 -*-
import codecs text_file = codecs.open("file.txt", encoding="utf-8") text = text_file.read()
import re
matches = re.findall(u"ich weiß nicht .+?[\.,]", text, flags = re.I)
for t in matches:
print t.encode("utf-8"), "
"
matches = re.finditer(u"ich weiß (.+?)[\.,]", text, flags = re.I)
for m in matches:
print m.group(1)