Bioinformatics Data Skills: Reproducible and Robust Research with Open Source Tools

This document was uploaded by one of our users. The uploader already confirmed that they had the permission to publish it. If you are author/publisher or own the copyright of this documents, please report to us by using this DMCA report form.

Simply click on the Download Book button.

Yes, Book downloads on Ebookily are 100% Free.

Sometimes the book is free on Amazon As well, so go ahead and hit "Search on Amazon"

This practical book teaches the skills that scientists need for turning large sequencing datasets into reproducible and robust biological findings. Many biologists begin their bioinformatics training by learning scripting languages like Python and R alongside the Unix command line. But there's a huge gap between knowing a few programming languages and being prepared to analyze large amounts of biological data. Rather than teach bioinformatics as a set of workflows that are likely to change with this rapidly evolving field, this book demsonstrates the practice of bioinformatics through data skills. Rigorous assessment of data quality and of the effectiveness of tools is the foundation of reproducible and robust bioinformatics analysis. Through open source and freely available tools, you'll learn not only how to do bioinformatics, but how to approach problems as a bioinformatician. • Go from handling small problems with messy scripts to tackling large problems with clever methods and tools • Focus on high-throughput (or "next generation") sequencing data • Learn data analysis with modern methods, versus covering older theoretical concepts • Understand how to choose and implement the best tool for the job • Delve into methods that lead to easier, more reproducible, and robust bioinformatics analysis

Author(s): Vince Buffalo
Edition: 1
Publisher: O’Reilly Media
Year: 2015

Language: English
Commentary: True PDF
Pages: 538
City: Sebastopol, CA
Tags: Command Line; Unix; Bash; Data Analysis; Bioinformatics; Pipelines; R; Best Practices; SQLite; Genomics; SSH; Automation; Markdown; Git; Version Control; Shell Scripting; Reproducible Research; Pysam; Python