Data Scientist @ Manchester, England, United Kingdom

David Springate

8 years ago

ROLE SUMMARY
Our data scientists will use their excellent statistical and data manipulation skills to develop new data products and improve data quality.

KEY RESPONSIBILITIES
Working as part of a data science team that performs:

Processing and manipulation of large and complex datasets. Fusion of disparate datasets to generate novel data products. Automation of these processes for efficiency
Validation and improvement of existing practices data quality and methodology. e.g statistical reporting on survey design, research meta-analyses over a variety of data sources and types, design and analysis of pilot studies (A-B testing)
Advanced analytics to extract meaningful information and generate new insights from big data. e.g. predictive modelling / machine learning, exploratory data analysis, social media mining / sentiment analysis

SKILLS & EXPERIENCE REQUIRED
Mandatory:

Good knowledge of R and/or Python programming
Experience in working with large, complex datasets
Statistical expertise (predictive modelling, inference, exploratory data analysis, experimental design)
Attention to detail
Ability to handle multiple responsibilities
Good time management and organisational skills
Keen to discover, learn, test and implement cutting-edge data science methodologies
Highly motivated and able to work with minimal supervision
Good interpersonal skills and able to communicate complex statistical concepts to non-statisticians
Bright attitude, willing to help other members of the team

Preferable:

SQL Database experience
Linux / bash scripting
Experience of advanced data processing, analysis and visualisation tools (e.g. dplyr, ggplot2, Pandas)
Knowledge of common machine learning algorithms, techniques and implementations
Knowledge of big data analytics frameworks (Spark, Hadoop etc.)
Understanding of data fusion / statistical matching methodology
Understanding of web protocols / HTTP / Web mining
Experience of survey design and/or working with survey data