Automatic processing of text data

NPFL098 / ATKL00345

Pavel Straňák

úterý 10.40–13.50
Malostranské nám. 25, SU1

2. 5. 2017

Warm-up Unix

Suspending a running process

find -criterion 'value' [-exec ;]


process substitution (almost like a temporary file)

diff file1 <(cat xy | sort | uniq -c | sort -nr)


start_btred -l <(ls *.t.gz)  -qTe \
'map {print join(", ", ListV($_->{"tnode.rfs"})), "\n" } ListV($root->{mwes})'

Version Control Systems


Most popular Distributed VCS.

Very good documentation is in this nice book:

VCS + Project management

Some VCS, often support for more choices (SVN, Git, Mercurial, Bazaar), Issue tracker, wiki, sometimes other components. Linking between components: refering to code changes (commits) wrom wiki or issues, linking resolved issues from commit messages, etc.

Github Best Practices

Not really just Github. One possible workflow. Makes sense for all big projects.