ROLE SUMMARY
Our data scientists will use their excellent statistical and data manipulation skills to develop new data products and improve data quality.
KEY RESPONSIBILITIES
Working as part of a data science team that performs:
- Processing and manipulation of large and complex datasets. Fusion of disparate datasets to generate novel data products. Automation of these processes for efficiency
- Validation and improvement of existing practices data quality and methodology. e.g statistical reporting on survey design, research meta-analyses over a variety of data sources and types, design and analysis of pilot studies (A-B testing)
- Advanced analytics to extract meaningful information and generate new insights from big data. e.g. predictive modelling / machine learning, exploratory data analysis, social media mining / sentiment analysis
SKILLS & EXPERIENCE REQUIRED
Mandatory:
- Good knowledge of R and/or Python programming
- Experience in working with large, complex datasets
- Statistical expertise (predictive modelling, inference, exploratory data analysis, experimental design)
- Attention to detail
- Ability to handle multiple responsibilities
- Good time management and organisational skills
- Keen to discover, learn, test and implement cutting-edge data science methodologies
- Highly motivated and able to work with minimal supervision
- Good interpersonal skills and able to communicate complex statistical concepts to non-statisticians
- Bright attitude, willing to help other members of the team
Preferable:
- SQL Database experience
- Linux / bash scripting
- Experience of advanced data processing, analysis and visualisation tools (e.g. dplyr, ggplot2, Pandas)
- Knowledge of common machine learning algorithms, techniques and implementations
- Knowledge of big data analytics frameworks (Spark, Hadoop etc.)
- Understanding of data fusion / statistical matching methodology
- Understanding of web protocols / HTTP / Web mining
- Experience of survey design and/or working with survey data