Job Description
Role and Responsibilities:
* Perform advanced data quality control and analysis of public health related data (e.g., large EMR datasets, large relational databases) using Python (PySpark) or R
* Cleanse data (data munging) and prepare datasets for analytics
* Perform statistical analyses, develop advanced data analysis and data visualizations and applications in Python and/or R
* Develop multilevel statistical models towards specific outcomes
* Understand, specify, process and present information from potentially several disparate data sets
* Generate actionable knowledge from data
* Prepare technical reports and routine data analysis reports using large scale data sets
* Assist interpreting data analysis findings and offer solutions for issues identified
* Communicate and collaborate with other scientists on aspects of study analysis and interpretation
Required Skills & Experience
* Masters or Doctorate Degree in Statistics, Biostatistics, Engineering, Computer Science, Mathematics, or similar field, including a minimum of 3 years of related work experience and graduate level coursework in statistics
* Experience with statistical techniques (e.g., measures central tendency, dispersion, variance, regression)
* Experience with cleaning data for analysis (i.e., data munging)
* Experience working with real world large databases and identifying data gaps and inconsistencies (i.e.- data validation and missingness patterns)
* 3-5 years of experience with quantitative analysis and data interpretation
* 3-5 years of experience programming in PySpark/Python programming for Statistical analyses and data management
* Experience with cloud platform, preferably Azure (Databricks, Azure Data Lake, Azure Data Factory, Python, and Power BI)
* Experience developing and supporting Python based AI/ML solutions
* Experience in developing machine learning models and algorithms, including the use of the following Python machine learning libraries: Numpy, Scipy, Scikit-learn, Theano, TensorFlow, Keras, PyTorch, Pandas, and Matplotlib
Nice to Have Skills & Experience
- Federal Experience
- Public Health Experience
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.