Bioinformatics
The Bioinformatics platform provides support for the analysis of various NGS workflows including single cell analysis, access to data analysis pipelines and bioinformatics training courses. Our service is open to internal and external customers on a fee for service basis. Our team strives to closely follow the latest scientific and technical developments so we can test and integrate appropriate end-to-end computational workflows for our customers.
To that end, we
- Conduct the primary processing and data management for the NGS platform
- Develop and provide state-of-the-art transferrable, reusable and reproducible bioinformatics analysis pipelines for various next generation sequencing (NGS) applications as a service (visit Computational Pipelines for details)
- Provide training, using interactive platforms to enable users to perform follow-up analytics and visualizations without the need for own large-scale bioinformatics infrastructure and resources (visit Training and support for details)
- Provide consulting, customized analysis, mentoring of students and knowledge transfer projects (visit Platform services for details).
The Bioinformatics platform provides support for the analysis of various NGS workflows including single cell analysis, access to data analysis pipelines and bioinformatics training courses. Our service is open to internal and external customers on a fee for service basis. Our team strives to closely follow the latest scientific and technical developments so we can test and integrate appropriate end-to-end computational workflows for our customers.
To that end, we
- Conduct the primary processing and data management for the NGS platform
- Develop and provide state-of-the-art transferrable, reusable and reproducible bioinformatics analysis pipelines for various next generation sequencing (NGS) applications as a service (visit Computational Pipelines for details)
- Provide training, using interactive platforms to enable users to perform follow-up analytics and visualizations without the need for own large-scale bioinformatics infrastructure and resources (visit Training and support for details)
- Provide consulting, customized analysis, mentoring of students and knowledge transfer projects (visit Platform services for details).
Our Platform Services
Primary data processing transforms the raw data generated by the sequencing instruments into quality-scored nucleotide sequences (FASTQ format), a requirement for further processing of the data. Additionally, a QC report and checksum files to verify the file integrity of transferred data are generated. This step is an integral part when the sequencing service of the NGS platform at HMGU is used.
User fees & service usage: Included in sequencing service of NGS platform.
We offer basic processing of single cell and bulk sequencing data for different platforms and technologies including re-processing of published/public datasets. The computational pipelines perform all necessary steps (e.g. alignment, peak calling, feature counting etc.) required for tertiary analyses and generate a QC report. Please see Computational Pipelines for details.
User fees: Per project depending on the number of samples processed.
Customized analysis is tailored to an individual project and can only be done in close collaboration with the customer. At the beginning of each custom analysis we will schedule a mandatory consultation session to discuss the project in detail as well as possible analysis strategies, milestones and expected results. After accepting the written agreement and provision of a valid payment source we will schedule the work. Customers can expect updates in interim meetings and will also obtain interim reports and a final report with the results. Upon completion of the project the users will receive all data (pipelines, processed data, analysis code) that were generated for the project.
User fees: Scope and costs will be defined in the proposal and listed in an offer which needs to be accepted by the user.
Together with our colleagues from CF Statistical consulting we regularly run a Data Analysis cafe. In these sessions we offer consulting e.g around experimental design (prior to start of a project), analysis and computational implementation questions with our users. For more regular consultation we offer Mentoring (see separate tab).
This is an extension of a single consultation meeting for those that need more regular supervision while doing the analysis themselves. A platform staff member meets regularly (e.g. once a week for 1h) for with the user. A short proposal will be created with the contents of the mentoring.
User fees: Initial consulting (1h) is free for HMGU members, afterwards billed hourly.
Helholtz Munich scientist wishing more in depth learning in bioinformatics and NGS-analysis will be embedded into the Bioinformatics team for a limited time (e.g. 3-6 Monate). Scientists will obtain continuous support from core team members for their project, but will also contribute to other projects in the platform. Prerequisits for the service are basic skills in R, Linux-opeatring systems (+ Bash-Scripting) und ideally Python.
User fee: monthly rate, please contact us for more detail.
Please note: This service can only be made available when core workload allows.
Our mission is to provide state-of-the-art transferrable, reusable and reproducible bioinformatics analysis pipelines for various next generation sequencing (NGS) applications. The platform wants to always keep up and offer service for the latest research developments. Therefore, we collaborate with PhD students and Postdocs who work on genomics projects and develop new computational pipelines and methods. Researchers profit from the platform's knowledge base and learn best practices on how to develop reusable and transferable workflows. Once developed, the workflows are then offered by the platform as a service to its user base.
User fees and service usage: free for HMGU members, please contact us for capacity and details.
Galaxy pipelines
FFor users with no prior experience in bioinformatics and just starting with the analysis of NGS data we recommend taking our courses where we provide basic and advanced training (see Training Courses). Our courses will train you on how to use computational pipelines using Galaxy (https://galaxyproject.org).
Advanced computational pipelines
For the following NGS technologies, computational pipelines were established and are available for platform users and projects. These pipelines are developed using workflow frameworks such as Nextflow [1] or Snakemake [2] and are executed on the compute cluster. The usage is best suited for trained users with bioinformatic background and knowledge of Linux and command line usage.
We offer the processing of your data with the computational pipelines through our Platform services that can be booked via iLabs (Secondary data processing).
Further information for HMGU users regarding the usage of the pipelines is available from here.
Available computational pipelines:
Bulk sequencing:
- RNA-Seq & miRNA-Seq
- ChIP-Seq
- ATAC-Seq
- WGS variant calling
Single cell sequencing:
- 10x-RNA-Seq
- 10x-ATAC-Seq
- 10x-Visum (Spatial Transcriptomics)
- 10x-VDJ (Immune Profiling)
- 10x-Feature-Barcoding
- 10x-Multiome (ATAC-seq + RNA-Seq)
- Drop-Seq
- Smart-Seq
- Single cell variant calling
- Genetic demultiplexing of single cell data
References
[1] https://www.nextflow.io
[2] https://snakemake.readthedocs.io/en/stable/
We offer a wide variety of high-quality statistical courses at different levels of expertise to Helmholtz staff and, upon request, to external institutes.
We currently offer five different courses covering the most important aspects of bioinformatic analysis of sequencing data. The course "Introduction to Bioinformatics for NGS data" is aimed at beginners in Next Generation Sequencing (NGS) projects. In addition to the general aspects of experimental design and sequencing techniques, this course gives an overview of the analysis of NGS data.
Our courses for RNA-Seq and ChIP-Seq aim to cover the full analysis process end-to-end from reads to results. For our courses we use the publicly available open-source web based Galaxy platform. Galaxy provides a wealth of tools and computational resources for processing data from various omics technologies and allows to build automated workflows.
Our course “NGS downstream analysis” focuses on the bioinformatic analysis of pre-processed NGS data (e.g., RNA-Seq or ChIP-Seq data). The main topics of this course are statistical testing, visualization of the results and Gene Set Enrichment Analysis.
For single-cell sequencing projects, we offer the “Single-cell transcriptomics best practices” course, which covers the most important analysis steps and gives an overview of recommended tools for each step.
All courses are held as online courses via the Teams app, except the ChIP-Seq course, which can be completed on a self-study basis. Please use the Campus Management System CaMS to see the dates of our courses and for registration. To get deteiled information about course of your interest, please click on the particilar button below.
Introduction to Bioinformatics for NGS data
Differential gene expression analysis using RNA Seq data and the Galaxy platform
Differential protein binding to DNA using ChIP-Seq data and the Galaxy platform