Location Burlington, North CarolinaJob ID 19-87759
The Center of Excellence (CoE) in Data Science and Bioinformatics at LabCorp Information Technology department applies various data science disciplines (including artificial intelligence, machine learning, graph databases, statistics, bioinformatics, and natural language processing) to our clinical, operational and financial challenges, and creates opportunities to enhance the value of our offerings to our customers. Additionally, it integrates the data sciences and bioinformatics efforts between the LabCorp diagnostics and drug development units and serves as a collaboration platform to foster teamwork and learning throughout the LabCorp IT organization.
The internship program in this CoE provide a unique opportunity for the students to interact with the CoE personnel and get hands-on experience and knowledge of solving real life problems in data sciences and bioinformatics. It also contributes directly to LabCorp research and development efforts to address challenging data science issues and speed up critical production development efforts.
Interns typically work on specific bioinformatics projects under the guidance of CBI members. A few examples of typical projects are:
Data mining and predictive modeling of diseases using the laboratory test data on over half the US population
Data Mining and Deep Learning for Non-Coding Regulatory Variants: Human genetic mutations in the protein-coding regions are responsible for only a small number of diseases, while 90-95% of human genetic diseases are caused by non-coding and regulatory variants. Therefore, better identification of non-coding regulatory elements and understanding of the impact of regulatory variants could have major impact for leveraging genomic data for improving human health.
Application of novel genomics and bioinformatics tools to better characterize the Human Leukocytes Antigen (HLA) alleles and apply it to improve survival rates in transplantation.
Copy Number Variation (CNV) Algorithm Bakeoff Project: Evaluate the performance of these tools to identify the best one to incorporate into the genetic tests offered by Labcorp
Extending the analytics and visualization of capabilities of in-house developed Integrated Data and Analytics Portal (IDAP) to include somatic variant classification and include CNV calls from all NGS assays
Hybrid On-Premise-Cloud Bioinformatics Workflow: Develop a workflow that seamlessly integrates the results of analysis carried out in the cloud with those generated in-house
Education and qualifications
Pursuing a BS, MS, or Ph.D. degree in computer science, bioinformatics or related discipline
Enjoy solving problems; inquisitive and analytical; engaged and motivated
Able to work both independently and as part of a larger team
Ability to see the big picture and work towards that goal
You have a basic understanding of genomics and biology
You are familiar with machine learning; for example, using TensorFlow for deep learning and convolutional neural networks
You know the difference between a fastq file, a BAM file and a VCF file.
You have programming experience in Python, Java, R, or Perl.
You are familiar with public bioinformatics resources (NCBI)
Some experience with cloud computing (e.g., Amazon AWS, Google Cloud, Microsoft Azure) is desirable
Interns from previous years have successfully moved on to industry positions pursuing careers in computation/bioinformatics analysis or to acquire advanced education in computation.