Job ID: JOB_ID_4738
Job Summary
We are looking for an experienced Senior Data Engineer with specialized knowledge of the TetraScience platform to design, build, and manage robust data pipelines. This role is crucial for processing scientific and laboratory data, integrating it into a cloud data platform for advanced analytics and AI initiatives. The ideal candidate will work closely with lab systems, instruments, scientists, and data scientists to ensure seamless data flow and high data quality.
Key Responsibilities
- Design, develop, and implement data pipelines specifically on the TetraScience platform.
- Ingest and process diverse lab instrument data from various systems including LIMS (Laboratory Information Management Systems), ELN (Electronic Lab Notebooks), and CDS (Chromatography Data Systems).
- Write efficient Python scripts for parsing, transforming, and cleaning raw scientific data files.
- Store, manage, and organize data within cloud data lakes or data warehouses, ensuring accessibility and integrity.
- Perform rigorous data validation and quality checks to maintain high standards.
- Collaborate effectively with scientists, data scientists, and IT teams to deliver comprehensive data solutions that meet business needs.
- Ensure strict adherence to data compliance and security protocols, especially within regulated environments.
- Monitor and optimize the performance of data processing workflows and pipelines.
- Troubleshoot and resolve issues related to data ingestion, processing, and storage.
- Document all aspects of the data pipelines, processes, and solutions.
Required Skills
- 10+ years of progressive experience in Data Engineering.
- Strong proficiency in Python programming and SQL for data manipulation and querying.
- Demonstrated experience with the TetraScience Data Platform.
- Proven ability in building and managing ETL/ELT pipelines.
- Knowledge of Databricks and Apache Spark for big data processing.
- Hands-on experience with major cloud platforms such as AWS, Azure, or GCP.
- Solid understanding of data lake and data warehouse concepts and architectures.
Good to Have Skills
- Experience within the Pharma or Life Sciences domain.
- Familiarity with LIMS, ELN, and CDS systems.
- Understanding of GxP (Good Practice) or other regulatory compliance standards.
- Experience working directly with scientific instrument data.
Employment Details
- Job Location: Boston, MA
- Employment Type: Hybrid
- Experience: 10+ Years
Special Requirements
Hybrid, Pharma/Life Sciences domain experience preferred, GxP/regulatory compliance knowledge preferred
Compensation & Location
Salary: $120,000 – $170,000 per year (Estimated)
Location: Boston, MA
Recruiter / Company – Contact Information
Email: i@thinklusive.net
Recruiter Notice:
To remove this job posting, please send an email from
i@thinklusive.net with the subject:
DELETE_JOB_ID_4738