Data Scientist Intern / Senior Data Scientist Intern
Job Description
Position Description:
Mathematica applies expertise at the intersection of data, methods, policy, and practice to improve well-being around the world. We collaborate closely with public- and private-sector partners to translate big questions into deep insights that improve programs, refine strategies, and enhance understanding. Our work yields actionable information to guide decisions in wide-ranging policy areas, from health, education, early childhood, and family support to nutrition, employment, disability, and international development. Mathematica offers our employees competitive salaries, and a comprehensive benefits package, as well as the advantages of being 100 percent employee owned. As an employee stock owner, you will experience financial benefits of ESOP holdings that have increased in tandem with the company's growth and financial strength. You will also be part of an independent, employee-owned firm that is able to define and further our mission, enhance our quality and accountability, and steadily grow our financial strength. Learn more about our benefits here.
At Mathematica, we take pride in our commitment to diversity. Building an inclusive culture that draws on the individual strengths of employees from different ethnic backgrounds, cultures, lifestyles, abilities, and experience is key to our success.
We are looking for Undergraduate & Masters-LevelData Scientist internsand PhD-levelData Scientist internsto support data processing and analysis tasks, such as building data pipelines, monitoring data quality, developing documentation, applying statistical and data science methods, and creating data visualizations during a 10-week internship program, starting June 5th and ending August 11th. Data Science internswill be paired with a mentor during their internship to gain insight into Mathematica's health policy work and how data supports it. Our data scientists underpin our company's core offerings in health policy program improvement and program assessment, which yield crucial evidence and information for policy and decision makers. This position can work remotely or in any of our offices, including Princeton, NJ; Washington, DC; Woodlawn, MD; Ann Arbor, MI; Cambridge, MA; Chicago, IL; Oakland, CA; or Seattle, WA. Mathematica also offers hybrid work options.
This position focuses on health policy, which includes projects such as:
- Monitoring the impacts of an alternative payment model for primary care in terms of care quality, cost, and health outcomes for diverse beneficiaries, using claims from millions of beneficiaries across the country and predicting future hospital costs and utilization.
- Developing and testing how claims and survey data from federal and state-level programs could be used to measure patients' experience of care, quality of life, care coordination, and long-term outcomes for beneficiaries enrolled in both Medicare and Medicaid.
- Creating an interactive data visualization tool to help local policy and decision makers understand how social determinants of health are related to health outcomes in their county, using open-source data from public agencies and non-profits.
- Building products/tools that integrate a suite of classic and novel methodologies used in Mathematica's projects, e.g., matching/weighting methods for impact analysis, measure reliability/validity testing
- Co-developing analysis plans with researchers and other data science colleagues
- Writing and maintaining production-level programming systems to obtain, combine, transform, store, and analyze datasets on cloud, internal, and client servers
- Developing and maintaining documentation
- Implementing reproducible research and quality assurance practices, such as environment management, version control, and testing
- Conducting analysis and communicating results, both to internal teams and clients, such as descriptive statistics, data visualizations, and model diagnostics
Position Requirements:
Requirements:
For Undergraduate & Masters-level Data Scientistinterns:
- Currently enrolled in or recently completed a master's program, or bootcamp, with an academic record including courses in subjects such as statistics, data science, data analytics, mathematics, operations research, computer science, and/or social science; equivalent years of experience can be substituted
- Demonstrated interest and/or experience using programming and data science and/or statistics to contribute to projects with a policy/social impact in academic and/or professional settings
- At least two years of experience performing data cleaning and analysis using programming languages such as R, Python, or Julia in the academic, extra-curricular, or professional environment
- Experience executing data science and statistics techniques including machine learning algorithms
- Ability and desire to work independently as part of an interdisciplinary team that may be geographically dispersed. This includes being able to learn resources such as self-guided tutorials, package documentation, and academic articles and willingness to constantly learn and contribute to knowledge sharing with team members.
- Experience with reproducible research principles, version control, interactive visualizations, and data science packages and libraries in at least one language:
- R packages such as tidyverse, R Shiny, and/or R Markdown
- Python packages such as numpy, pandas, and/or scikit-learn
- Julia packages such as DataFrames and/or MixedModels
- Nice to have: experience with healthcare datasets (for example, Medicare or Medicaid claims and enrollment data), SQL, Natural Language Processing, product/tool development skills, production-quality machine learning applications, cloud computing environments, and algorithmic fairness and ethics
- Currently enrolled in or recently completed a PhD program with an academic record including courses in subjects such as statistics, data science, data analytics, mathematics, operations research, computer science, and/or social science; equivalent years of experience can be substituted
- At least three years of experience performing data cleaning and analysis using programming languages such as R, Python, or Julia in the academic, extra-curricular, or professional environment
- Experience executing programming, data science, and statistics techniques to contribute to projects with a policy/social impact in academic and/or professional setting
- Ability to take an ambiguous question, propose rigorous methods, use data to draw insights, and convey results and findings to a wide range of audiences.
- Ability and desire to work independently, with minimal guidance, as part of an interdisciplinary team that may be geographically dispersed. This includes being able to learn resources such as self-guided tutorials, package documentation, and academic articles and willingness to constantly learn and contribute to knowledge sharing with team members.
- Demonstrated expertise in one of the following methodological areas: machine learning, agent-based modeling, natural language processing, causal inference, or Bayesian statistics.
- Experience with reproducible research principles, version control, interactive visualizations, and data science packages and libraries in at least one language:
- R packages such as tidyverse, R Shiny, and/or R Markdown
- Python packages such as numpy, pandas, and/or scikit-learn
- Julia packages such as DataFrames and/or MixedModels
- Nice to have: experience with healthcare datasets (for example, Medicare or Medicaid claims and enrollment data), SQL, product/tool development skills, production-quality machine learning applications, cloud computing environments, and algorithmic fairness and ethics
- $22 per hour for Freshman, Sophomore, or between Sophomore and Junior years
- $23 for Junior year or later
- $24 for students in their first year of Masters or between first and second year
- $25 for students in their second year of Masters or later
We are not working with outside agencies to fill this position.
In accordance with Executive Order 14042 and its implementing guidelines, all Mathematica employees must provide documentation that they have been fully vaccinated or obtain an accommodation through Human Resources by providing documentation from a licensed health care provider that they are unable to be vaccinated against COVID-19 because of a disability (which would include medical conditions) or provide an attestation that they are entitled to an accommodation because of a sincerely held religious belief, practice, or observance.
We are not working with staffing agencies to fill this position, and we will not accept unsolicited resumes. Please do not reach out directly to technical staff or leaders within Mathematica, as all questions from agencies go through the Talent Acquisition team.
Available Locations: Cambridge, MA; Princeton, NJ; Washington, DC; Woodlawn, MD; Ann Arbor, MI; Chicago, IL; Oakland, CA; Seattle, WA; Remote
#LI-AR1
We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state or local protected class.
Explore More
Mathematica is seeking undergraduate and master's-level Data Scientist interns and PhD-level Data Scientist interns Jobs
support data processing and analysis tasks Jobs
building data pipelines Jobs
monitoring data quality Jobs
developing documentation Jobs
More Jobs at Mathematica
Jobs in Baltimore, MD
Date Posted
02/14/2023
Views
5
Positive
Subjectivity Score: 0.8
Similar Jobs
Teachers at MedStar Good Samaritan Child Development Center - KinderCare Learning Companies
Views in the last 30 days - 0
View DetailsRelationship Banker - Mondawmin Financial Center - Bank of America
Views in the last 30 days - 0
View DetailsRisk Control Consultant, Property - Liberty Mutual Insurance
Views in the last 30 days - 0
View DetailsPrincipal RF/Digital Test Development Engineer - Swing Shift - Northrop Grumman
Views in the last 30 days - 0
View Details