Yale School of Public Health (YSPH) is seeking a Data Scientist to join the Public Health Data Science and Data Equity research team with a primary home in the Department of Biostatistics. The Data Scientist will work with Dr. Bhramar Mukherjee, PhD., the inaugural Senior Associate Dean of Public Health Data Science and Data Equity, Anna M. R. Lauder endowed Professor of Biostatistics, Professor of Chronic Disease Epidemiology, Professor of Statistics and Data Science, with a focus on creating a... more details
Yale School of Public Health (YSPH) is seeking a Data Scientist to join the Public Health Data Science and Data Equity research team with a primary home in the Department of Biostatistics. The Data Scientist will work with Dr. Bhramar Mukherjee, Ph.D., the inaugural Senior Associate Dean of Public Health Data Science and Data Equity, Anna M.R. Lauder endowed Professor of Biostatistics, Professor of Chronic Disease Epidemiology, Professor of Statistics and Data Science, with a focus on creating and maintaining software packages, providing user support and training in coding and software package writing, and providing statistical analysis for research projects.
Responsibilities include develop and execute new and/or highly complex machine learning algorithms and statistical models and determine analytical approaches and modeling techniques for analysis of complex healthcare data. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses. Manage analytical projects through the research cycle from data exploration, model building, performance evaluation, through implementation.
Develop shiny/JAVA apps, visualization tools and R-packages to support investigators at YSPH and facilitate their submissions to R CRAN. Develop work plans and monitor progress and project timelines. Document coding and changes to work plans using established work group methods in GitHub. Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work.
Attend weekly meetings to discuss team and project-related activities, issues, change, communications, and updates. Participate in preparation of grants and papers. Assist with data use agreements and dataset licenses. Support Yale SPH faculty and students with high performance computing and coding assistance. Develop short courses and workshops on using AI tools for coding and visualization. Prepare analytic datasets from a secure portal that stores data from large biobanks. Strong strategic, analytical, and problem-solving skills. 1. Develop and execute new and/or highly complex algorithms and statistical predictive models and determine analytical approaches and modeling techniques to evaluate potential future outcomes. 2. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses. 3. Manage analytical projects from data exploration, model building, performance evaluation, through implementation. 4. Develop work plans and monitor progress and project timelines. 5. Document coding and changes to work plans using established work group methods in GitHub. 6. Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work. 7. Attend weekly team meetings to discuss team and project-related activities, issues, change, communications, and updates. Master’s Degree in computer science, applied/computational mathematics, engineering, biostatistics, statistics, or a quantitative field such as astronomy or geology, economics, health policy, health services research, public policy, or a related field and two years of proven experience or an equivalent combination of education and demonstrated experience.