Automatic processing of text data


NPFL098 / ATKL00345

Pavel Straňák

stranak@ufal.mff.cuni.cz

úterý 10.40–13.50
Malostranské nám. 25, SU1

2. 5. 2017

Warm-up Unix

Suspending a running process

find -criterion 'value' [-exec ;]

xargs

process substitution (almost like a temporary file)

diff file1 <(cat xy | sort | uniq -c | sort -nr)

or

start_btred -l <(ls *.t.gz)  -qTe \
'map {print join(", ", ListV($_->{"tnode.rfs"})), "\n" } ListV($root->{mwes})'

Version Control Systems

Git

Most popular Distributed VCS.

Very good documentation is in this nice book: https://git-scm.com/book/

VCS + Project management

Some VCS, often support for more choices (SVN, Git, Mercurial, Bazaar), Issue tracker, wiki, sometimes other components. Linking between components: refering to code changes (commits) wrom wiki or issues, linking resolved issues from commit messages, etc.

Github Best Practices

Not really just Github. One possible workflow. Makes sense for all big projects.