Data Integration Specialist
The Enhanced Data and Insights Sharing (EDIS) initiative was newly created to accelerate learning and insights generation across Research and Development. A key capability identified as being critical to the success of EDIS is the integration and sharing of biomedical patient data across the organization. This capability requires the creation of data integration specialist roles. This specialist will be a scientific, technical and process expert for the annotation and integration of human cohort datasets generated across the whole organization or acquired externally in support of ongoing drug development. Data sources will include patient level clinical trial data, and biomarker data from laboratories in the department. The data integration specialist will integrate into a team of data analysis experts from the department and will work alongside the groups to understand and manipulate highly complex clinical and low/high dimension biomarker data sets in order to create analysis ready merged datasets (Data Marts). Proper annotation and integration of clinical and biomarker data should fully enable advanced analytics, discoverability, use and subsequent re-use of these important company assets. The data integration specialist will also be responsible for the upload of newly generated biomarkers data in databases.
- Track relevant clinical data and all planned and already-generated biomarker data for clinical trials (both active and closed), external human cohort studies and procured tissue collections and create Data Marts.
- For all Data Mart instances and together with curators, ensure clear annotation; descriptions of the nature and provenance of source data; reproducibility of analyses; and a list of contacts, including the primary bioinformatician, biostatistician and biomarker scientist as well as the individual granting access authorization.
- Generate study specific Data Mart and annotations that are optimized for data analysis tools and data aggregation whenever project or across project-level analyses are required.
- For newly generated biomarker data, discuss with laboratory scientists (from the company or external vendors) the data format requirements.
- Upload of newly generated biomarkers data in databases.
- Support the guidelines and policies on the decision-making process. Contribute to elaboration of best practices for data generation, aggregation, shared data use, and data exploration.
Qualifications and Experience:
- MS plus three years or PhD in Life Sciences, Bioinformatics, Biostatistics, or Data Science, with at least two years of experience in the pharmaceutical or diagnostics industry.
- Prior experience analyzing biomarker data and a proven track record of working with human cohort data sets, preferably high dimension (-omics) data sets, with a desire to apply this knowledge to existing and new scientific challenges.
- Experience with clinical data and knowledge of STDM and CDISC formats
- Knowledge of the UNIX/Linux operating system and demonstrated proficiency in R.
- Good knowledge of basic biological principles and the pharmaceutical or diagnostic development process.
- Systematic, structured, and pragmatic approach to tasks, with good attention to details.
- Able to deal positively and patiently with multiple requests and priorities.
- Drive for results (demonstrates interest and ability to learn new things, takes initiative, welcomes problems as challenges, finds solutions to technical problems, persists in the face of diversity).
- Great team player with strong written and oral communication, collaboration, and organizational skills.
- Familiarity with LIMS systems, relational databases, data visualization tools (e.g., Spotfire) would be desirable.
- Fluency in written and spoken English
- Relevant working/residency permit or Swiss/EU-Citizenship required