Data Engineer

Updated: 2 months ago
Location: New York City, NEW YORK
Job Type: FullTime
Data Engineer  
7524- NEU Neurology  
Medical Center  
Officer Full-Time Regular  
Research Support (Laboratory and Non-Laboratory)  
We are looking for an enthusiastic and experienced individual to join our research effort investigating genetic, molecular, and behavioral variation in neurological disease. This role is responsible for developing and implementing our multi-modal data architecture, including data storage, management, processing, and querying. This involves incorporating large-scale next-generation sequencing, imaging, clinical, and "omics" data into a unified framework.
The successful candidate will work closely with clinicians, research scientists, and bioinformaticians to design data storage, retrieval, and organizational solutions in order to spur new avenues of research into neurological disease. The ideal candidate should have a background in Computer Science, Data Science, Data Management, or related field, with proven experience in managing large-scale data systems. Familiarity with next-generation sequencing data is preferred, but not required.
The position therefore offers a stimulating and multi-disciplinary environment and the opportunity to work with a variety of researchers at Columbia University to tackle an important aspect of human health research. There will be many opportunities to contribute to multiple ongoing national and international collaborative projects.
1. Responsible for developing and implementing a data architecture and management system to store and handle large-scale data from multiple modalities.
2. Work with bioinformaticians to structure data for quick processing through established bioinformatics pipelines.
3. Contribute to QA/QC of pilot and production data sets.
4. Display initiative and independence in providing rapid results to various investigators generating and querying experimental data.
5. Prepare summary reports of data and results for dissemination to colleagues and collaborators.
6. Directly respond to inquiries regarding projects being managed. Produce subsets of data for distribution to collaborators as approved by the principal investigators.  
Requires bachelor's degree or equivalent in education and experience, plus four years of experience.  
-Demonstrated database development and maintenance skills
-Strong organizational skills in managing and organizing large data sets
-Programming experience in a scripting language such as R, Python, or Perl
-Extensive experience with DDL, DML, and DCL, through a database language such as SQL
-Ability to work independently, display initiative within a team environment, and respond rapidly to requests  
Standard Posting  
Open Until Filled  
Columbia University is an Equal Opportunity/Affirmative Action employer.  
Columbia University is committed to the hiring of qualified local residents.  
View or Apply

Similar Positions