Note: I am still working on this…

Merck-Purdue Data Mine Collab: Where do the undergrads go?”

2022 - Purdue Industrial Math and Stat Seminar

Presenter Title Affilation
Terri Bui, PhD Data Science & Informatics – Merck Research Labs  

Description:

As big data and analytics are being adopted across industries – Merck is also following suit. Here we present a method of engagement for academia and industry developing agile solutions and platforms to expedite the integration of data science into pharmaceutical sciences. In this engagement, students and Merck researchers collaborate by exploring technologies pertinent to big-data strategies that will enable data-driven research decisions through the development, design, and implementation of:

  1. a biometric workflow and integration platform
  2. radio-frequency (RFID) software solution
  3. natural language processing (NLP) on scientific text;

While these technologies have been readily adopted in other industries, their usage within the clinical and pharmaceutical research and development sector is still being explored from a compliance and integration perspective. Thus, the value provided in utilizing Merck-Purdue model of engagement by exploring new libraries and packages within the framework of general industrial ‘use cases’ can serve as preliminary ‘data’ for early adoption and operationalization.

Future works involve expanded areas of research within MRL and Purdue Faculty involvement.

Data Science and Statistics in Pharmaceutical Engineering and Experimental Studies: Where is it, where is it going?

2021 Joint Statistical Meetings (JSM) - Physical and Engineering Sciences - Pharmaceutical Research and Manufacturers of America

Panel Title Affiliation
Stan Altan, PhD Sr. Director & Fellow Johnson & Johnson
Mark Daniel Ward, PhD Prof. of Statistics and (by courtesy) of Mathematics and Public Health Purdue University
Terri Bui, PhD Sr. Scientist Merck & Co.,
Ke Wang, PhD Director Pfizer Inc.
Jonathan E. Allen, PhD Principal Investigator Lawerence Livermore National Laboratory
Huanyu Zhou, PhD Sr. Director Teva Pharmaceuticals

Data Science (DS) as a discipline seeks to bring new intensive algorithmic, data-driven approaches and insights. DS can increase scientific understanding and can elucidate essential relationships from large, complex, structured or unstructured data. The tools of DS are highly computational.

Students who learn DS need to go beyond traditional statistical approaches and design of experiment principles. In pharmaceutical engineering and research, large datasets are found throughout the drug discovery and development process, in both clinical and non-clinical studies. Research at the boundary of Statistics, DS, and Pharmaceutical Engineering is a fertile area for the application of DS approaches, related to Artificial Intelligence, Machine Learning, Computational Statistics, and related methodologies. Standard traditional statistical approaches are giving way to these newer methods. Consequently important questions arise regarding the respective roles of Statistics and DS, with emphasis on statistical engineering and experimental studies in the CMC and other areas.

The invited panel consisted of 5 panelists who are experienced data science practitioners discussing the following questions:

  1. What is DS, and how does it differ from traditional statistics?
  2. What are the appropriate roles of DS and Statistics in furthering a deeper understanding of Pharmaceutical Science and Engineering?
  3. What are the hurdles preventing the Pharmaceutical Sciences from utilizing data science tools?
  4. How can we improve the communication and collaboration capabilities, between Pharmaceutical domain experts and DS/Statistics colleagues?
  5. Where will we be 10 years hence?

Title: Integrating Data Science and Statistics in Pharmaceutical Statistical Engineering and Research

JSM 2020: Joint Statistical Meetings 2020 - Physical and Engineering Sciences

Type: Panel

Presenter Affilation
Stan Altan (Presenting) Johnson & Johnson
Brad Evans Pfizer
Terri Bui Merck & Co., Inc
Mark Ward Purdue University

Abstract: Data Science (DS) as a discipline seeks to bring new approaches to further scientific understanding and elucidate essential relationships with an emphasis on “large” datasets. The tools of DS are highly computational and do not follow traditional statistical approaches. In pharmaceutical engineering and research, large datasets are found throughout the drug development process, in both clinical and nonclinical studies. Therefore these represent fertile areas for the application of DS approaches, related to AI, ML and others. Standard traditional statistical approaches are giving way to these newer methods, and consequently important questions arise regarding the respective roles of Statistics and DS. This round table seeks to bring together individuals from both disciplines to discuss the following questions among others:

  1. What is DS, how does it different from statistics?
  2. What are the appropriate roles of DS and Statistics in furthering better understanding of pharmaceutical scientific and engineering studies?
  3. What are the hurdles preventing the Pharmaceutical Sciences from utilizing data science tools?
  4. How does one build a business case for DS approaches?