Reproduciable Science

Lecture notes

Reproduciable Science (pdf)

Exercise

Please work in groups of two. During the exercise fill your comments directly in the GoogleDoc file using the link here.

1. Markdown

Open Haroopad/MacDown. Explore the markdown editor by yourself. Try to generate titles of different sizes, add plain text and code and save the markdown files as pdf. Use either the features in the insert menu or the cheat sheet.

2. RegEx

Download the table from the course website here and open it with the Atom editor. Go to the find and replace section and activate the regex mode.

Remove the ending (_L*) from each fasta header. The cheat sheet might be helpful.

Solution

>([A-Z,0-9]+)_L[0-9]+

>$1

3. Data manipulation

Download the table from the course website here. It is a subset of the sex of life database. For each plant species the sexual system is provided.

The goal is to reformat the table from the version you have into the format below.

Genus Species Sexual_system
Silene acutifolia hermaphrodite
Silene adenocalyx hermaphrodite
Silene aegaea hermaphrodite
Silene aegyptiaca hermaphrodite
Silene aellenii hermaphrodite
Silene almolae hermaphrodite
Silene alpestris hermaphrodite
Silene ammophila NA
Silene amoena NA
Silene acaulis non_hermaphrodite
Silene dioica non_hermaphrodite
Silene latifolia non_hermaphrodite
Silene vulgaris non_hermaphrodite

(1) Write a short log file to describe what you did using your favorite editor or excel.

In the next exercise we going to use R to do the manipulation.

Additional information