Principal Data Scientist Machine Learning for Biologics
Company: Sanofi
Location: Cambridge
Posted on: June 24, 2022
Job Description:
OverviewAt Sanofi Large Molecule Research Platform, we have a
strong ambition to utilize machine learning (ML) and artificial
intelligence (AI) in all aspects in biologics discovery and
development, with comprehensive programs spanning from deep
repertoire mining to de novo design. Additionally, we are open and
actively collaborate with leading biotech and academia. The new
opening senior data scientist role is to seek a machine learning
expert with strong expertise in modern data analysis methods,
including NLP, graph, and 3D geometry-aware models. The successful
candidate will work in an interdisciplinary team to apply
cutting-edge computation, ML/DL, sequence-based and structure-based
methods to resolve challenges in real-world drug discovery. This is
an exciting opportunity to contribute to the process of design and
engineering revolutionary biologics, including multivalent,
multi-targeting molecules by leveraging large amount of proprietary
data coming from our industry-leading high-throughput automation
platforms. The successful candidate will gain deeper insight into
drug development process and transform ML/DL algorithms into
accelerating biologics discovery and development process, and make
impacts to patients' life.Key Responsibilities
- Evaluate and develop state-of-the-art computational methods to
decode biophysical and geometrical features from antibody-antigen
datasets and create predictive models for engineering.
- Develop and apply complex machine/deep learning solutions to
our high-content and high-quality proprietary datasets, as well as
public datasets.
- Perform data querying and feature extraction to improve current
workflow for antibody/nanobody engineering, including affinity
modification, cross-reactivity engineering, liability risk
prediction and mitigation, multi-specific antibody engineering, and
de novo antibody design.
- Maintain a keen awareness of recent developments in data
science, bioinformatics, and state-of-the-art of ML/DL algorithms,
aiming to accelerate development of new computational
algorithms.
- Effectively collaborate with colleagues with diverse scientific
background, identify problems and opportunities, combine
computational and structural analysis to support large molecule
projects.Basic qualifications:
- Ph. D. in related field such as Biostatistics, Physics,
Computation Biology, Biomedical Engineering, Computer Science,
Applied Mathematics, Structural Biology with at least 5 years of
experience or Master with at least 7 years of relevant
experience.
- Significant depth of expertise in ML/DL, hands-on experience
with modern machine learning models, including Transformers, Graph
NN, Recurrent NN, MLP etc.
- Sufficient in programing using deep learning libraries such as
PyTorch, TensorFlow, and Keras.
- Experience with cloud computing, parallel computing, and/or
supercomputing is expected.
- Track record of applying machine learning/ deep learning
approaches to solve molecule-related problems.
- Familiarity with protein structure or sequence featurization
and learned embeddings.
- Experience with database mining, big data, and large-scale
virtual screening using Bayesian Optimization or Gaussian Processes
is desired.
- Familiarity with Data Visualization tools/libraries and
dimensionality reduction algorithms.Preferred qualifications:
- Understanding of protein structure and protein-protein
interaction.
- Experience with structural analysis and optimization of
biochemical and biophysical properties, like thermodynamics, with
protein design tools e.g. Rosetta, BioLuminate, or MOE, etc.
- Understanding of biologics R&D process is a plus.At Sanofi
R&D North America, we deliver meaningful solutions for
patients. We transform science into breakthrough, best-in-class and
first-in-class medicines and vaccines. We believe in creating a
diverse and inclusive workforce - and workplace - which brings
together the collective brainpower of over 2,000 colleagues and
provides you with an exciting place to grow and develop. We set the
bar high, and we deliver. Join us and together we will build on our
trusted legacy of breakthroughs for society.Sanofi Inc. and its
U.S. affiliates are Equal Opportunity and Affirmative Action
employers committed to a culturally diverse workforce. All
qualified applicants will receive consideration for employment
without regard to race; color; creed; religion; national origin;
age; ancestry; nationality; marital, domestic partnership or civil
union status; sex, gender, gender identity or expression;
affectional or sexual orientation; disability; veteran or military
status or liability for military status; domestic violence victim
status; atypical cellular or blood trait; genetic information
(including the refusal to submit to genetic testing) or any other
characteristic protected by law.As a healthcare company and a
vaccine manufacturer, Sanofi has an important responsibility to
protect individual and public health. All US based roles require
individuals to be fully vaccinated against COVID-19 as part of your
job responsibilities.According to the CDC, an individual is
considered to be "fully vaccinated" fourteen (14) days after
receiving (a) the second dose of the Moderna or Pfizer vaccine, or
(b) the single dose of the J&J vaccine. Fully vaccinated, for
new Sanofi employees, is to be fully vaccinated 14 DAYS PRIOR TO
START DATE.#GD-SA
#LI-SAPDNAt Sanofi diversity and inclusion is foundational to how
we operate and embedded in our Core Values. We recognize to truly
tap into the richness diversity brings we must lead with inclusion
and have a workplace where those differences can thrive and be
leveraged to empower the lives of our colleagues, patients and
customers. We respect and celebrate the diversity of our people,
their backgrounds and experiences and provide equal opportunity for
all.
Keywords: Sanofi, Cambridge , Principal Data Scientist Machine Learning for Biologics, Other , Cambridge, Massachusetts
Didn't find what you're looking for? Search again!
Loading more jobs...