Coordinator for the German Human Genome-Phenome Archive Bavarian Data Hub

Updated: 40 minutes ago

Building a Bavarian-wide secure compute environment for genomic data within GHGA

05.03.2024, Wissenschaftliches Personal

Are you passionate about the intersection of cloud computing, IT security, big data, and biomedical research? The German Human Genome-Phenome Archive (GHGA) is seeking a talented and driven individual to join our dynamic team. As the group IT specialist with a background in data science and bioinformatics, you will play a pivotal role in shaping the future of biomedical data storage, analysis, and dissemination in Bavaria and beyond using state-of-the-art cloud technologies and workflows. Join us in revolutionizing the field and impacting genomic medicine research!


Who we are

The German Human Genome-Phenome Archive (GHGA ) consortium is a national secure and trustworthy network of data centers for archiving, processing, and serving human genome sequencing and other omics data for secondary use in biomedical research. It is part of the National Research Data Infrastructure (NFDI) initiative and is the German node for the federated European Genome-Phenome Archive (fEGA). The Munich-based GHGA team is hosted by the chair of Computational Molecular Medicine (CMM ), which is leading the next-generation sequencing workflow and data hub development. The chair is located in the informatics department of the Technical University of Munich, one of the top-ranked European universities. The team is co-hosted by the Institute of Human Genetics at the Klinikum rechts der Isar. It has strong links to other local scientists and institutions in biology and medicine but also close connections to private cloud providers.


Job description

We are looking for an experienced IT specialist with a strong background in bioinformatics and data stewardship to oversee the implementation and management of the Bavarian Data Hub of the GHGA. As a GHGA Data Hub Coordinator, you will organize, secure, and manage the underlying IT Infrastructure as well as the genomic data within and between the GHGA data hubs. You will be responsible for overseeing and implementing the flow of data, ensuring its accuracy, integrity, and accessibility. Your primary duties will be ensuring that the genomic data is safe and secure within a cloud environment and deploying GHGA-defined microservices using Terraform and container orchestration tools. In addition, you will collaborate with various GHGA workstreams, such as Architecture, Workflows, and the ethics and legal team, to understand their data and policy needs, implement data governance policies, maintain data quality standards, and optimize data workflows. You will also work with the local data protection officers and legal and IT teams to align data protection, legal, and contractual requirements and implement an IT Security Management System (ISMS). The data hub will be deployed and run within a private cloud provider environment, enabling scalable, safe, and reproducible on-demand data processing and archiving. The successful candidate will lead the local data and cloud management team and form part of the nationwide GHGA consortium, participating in the Operations Committee meetings and decision-making processes.


Your responsibilities
  • Deploy a secure cloud environment to archive and process genomic data at scale, building on private cloud providers.
  • Establish and maintain data access controls and permissions to ensure data security and confidentiality.
  • Deploy and secure microservices using Terraform on a private cloud provider such as GCP or AWS.
  • Coordinate data collection, storage, and dissemination processes within the data hub.
  • Implement an IT Security Management System (ISMS)  in collaboration with the local DPO and IT team.
  • Serve as a liaison between technical and non-technical workstreams, facilitating communication and collaboration on data-related initiatives.
  • Work with different workstreams to understand their data and legal requirements and ensure data is collected and stored appropriately.
  • Implement and enforce data governance policies and procedures to maintain data quality, security, and GDPR compliance.
  • Develop and maintain documentation related to microservices, data sources, schemas, and processes.
  • Monitor the IT infrastructure and data quality and integrity, identifying and resolving issues as they arise.

Requirements
  • PhD or postdoc experience in computer science, data management, information systems, bioinformatics, or related field.
  • Strong understanding of data management principles, practices, and technologies.
  • Familiarity with data governance frameworks, such as GDPR, HIPAA, or similar regulations.
  • Proficiency in working in a UNIX-based ecosystem (partially in systems administration and hardening).
  • Experience in cloud computing infrastructure and platforms (OpenStack, AWS, Google Cloud).
  • Experience with containerized package and application deployment systems (Docker, Terraform, etc.)
  • Proficiency in at least one programming language (Python, Java, R, Go, …).
  • Excellent English communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.
  • Ability to work effectively in a fast-paced, dynamic environment with cross-functional teams.

Desirable but not required
  • Experience in project management and leading multidisciplinary teams.
  • Experience in data management tools and technologies, such as SQL databases, data warehouses, ETL tools, and data visualization platforms.
  • Certification in IT security, cloud computing, data management, or related fields is desirable.
  • Experience in developing and deploying complex software systems.

We offer
  • Opportunity to be at the forefront of advancing biomedical research through innovative data science and bioinformatics approaches.
  • International, diverse, LGBTQ-friendly, and interdisciplinary working environment.
  • Working with world-leading IT partners such as private cloud providers and the Leibnitz Rechenzentrum (LRZ).
  • Salary according to TV-L up to level E14, including social benefits.
  • Flexible working hours and home-office policy.
  • Possibilities for further scientific qualification and personal development.
  • Disabled applicants with equal suitability and qualifications will be given particular consideration.
  • The TUM strives to increase the proportion of women, so applications from women are therefore expressly welcomed.

Application

The full-time position is funded until the end of the first GHGA funding period (30.09.2025) with a salary according to the TV-L (German academic salary scale) up to level E14 with a probable extension of another 5 years. The position is available immediately.

Applications should include a cover letter, CV, and references and must be sent to [email protected] by 31 March 2024, with “GHGA Data Hub Coordinator” in the subject.


More Information

http://ghga.de/
https://www.cs.cit.tum.de/cmm/
https://www.mri.tum.de/humangenetik
https://tum.de

Prof. Dr. Julien Gagneur
Computational Molecular Medicine
Technical University Munich
Boltzmannstraße 3
85748 Garching bei München Germany


The position is suitable for disabled persons. Disabled applicants will be given preference in case of generally equivalent suitability, aptitude and professional performance.


Data Protection Information:
When you apply for a position with the Technical University of Munich (TUM), you are submitting personal information. With regard to personal information, please take note of the Datenschutzhinweise gemäß Art. 13 Datenschutz-Grundverordnung (DSGVO) zur Erhebung und Verarbeitung von personenbezogenen Daten im Rahmen Ihrer Bewerbung. (data protection information on collecting and processing personal data contained in your application in accordance with Art. 13 of the General Data Protection Regulation (GDPR)). By submitting your application, you confirm that you have acknowledged the above data protection information of TUM.

Kontakt: [email protected]


More Information

Similar Positions