Data Engineer


Company 

Cranleigh STEM, Sustainability & SHEQ Recruitment

Location 

cambridgeshire

Employment Hours 

Full Time

Employment Type 

Permanent

Salary 

Job Requirements/Description

Role Overview:

We are seeking a skilled Data Engineer to join our Data Team and manage our data infrastructure. In this role, you will play a crucial part in streamlining data flow across the organization by integrating data between teams. You will be responsible for overseeing data flow management and maintaining cloud infrastructure to support our sequencing projects and downstream data analysis.


At Origin Sciences, we utilize advanced sequencing technologies to analyze mucus-based biospecimens, generating large volumes of data. The Data Engineer will manage the cloud infrastructure that supports these sequencing projects, enabling both clinical analytics and BI reporting.


Main Duties & Responsibilities:

  • Manage and optimize cloud resources to handle large-scale sequencing data
  • Implement infrastructure improvements to enhance usability, performance, and security
  • Automate data collection processes from laboratory instruments
  • Use ETL processes to centralize organizational data
  • Provide ad-hoc engineering support to laboratory and clinical teams


Skills & Qualifications:

  • Bachelor’s degree in a relevant technical field or equivalent experience
  • Proficiency in Python, R, or other programming languages for data processing
  • Experience managing and configuring cloud infrastructure and resources
  • Knowledge of cloud security best practices
  • Experience integrating APIs for process automation
  • Familiarity with containerization tools (Docker, Singularity)
  • Experience with schedulers such as AWS Batch, GCP Batch, or Slurm
  • Proficiency in Git and version control
  • Ability to critically assess data-handling practices in a commercial R&D setting
  • Understanding of data management best practices


Desirable Skills:

  • Associate-level cloud certification
  • Knowledge of SQL and relational databases
  • Understanding of UK GDPR requirements for processing human genomics data

Company 

Cranleigh STEM, Sustainability & SHEQ Recruitment

Location 

cambridgeshire

Employment Hours 

Full Time

Employment Type 

Permanent

Salary 

An error has occurred. This application may no longer respond until reloaded. Reload 🗙