Metagenomic data can be rich, but finding interesting patterns can be a challenge. What should we be looking for that's new and unexpected, and how do we find it?
14. Text Parsing
• Automate Oligos file creation
• Produce read count tables
• Parse richness and diversity results
• Parse error logs
• Split data into groups
18. Running the Workflow
1. Gather your data
2. Prepare an Oligos file
3. Zip everything up and copy to the “work” folder
4. Run mothur.sh
5. Come back in a few hours/days
6. Run ReadCountTable.py on the taxonomy output
7. Do additional downstream processing
8. Publish results
19. What we need help with
Data management
Post-hoc OTU naming
Improved scripting
Identifying new tools
Other stuff I haven’t thought of