Data Scientist Technical Specialist
Location: Chantilly, VA
Clearance: TS with SCI eligibility
Looking for a talented Data Scientist , to analyzing production data which will lead to the program choosing right-fit technology solutions.
Work closely with entity resolution subject matter experts, program engineers and stakeholders to integrate complex data sources into operation and develop new analytics and algorithms
Support advanced data exploitation capabilities using Hadoop related technologies
Enhance the data processing infrastructure using open source and commercial technologies
Capture and define requirements for entity resolution features and functionality from a UI end-user perspective
Analyze documentation and source data of new data feeds
Provide O&M monitoring and resolution of production web services using monitoring tools including Ganglia, Jenkins, and other means
Modify pipeline scripts to enhance functionality and support bug fixes
Resolve bugs by modifying pipeline scripts and entity resolution explanation service scripts
Coordinate development efforts in an agile team
You have a Bachelor’s degree in Engineering, Computer Science, or other related analytical, scientific, or technical discipline. Equivalent experience may be considered in lieu of degree.
Have 10 years of experience with scripting languages, such as Python, used in support of development and production operations.
Have 10 years of experience analyzing and reporting on large datasets .
Experience with using Apache Spark to aggregate and analyze large data sets from various sources.
Experience with designing and implementing custom machine learning algorithms.
Experience with graph algorithms and semantic Web.
Experience with using SQL to conduct complex database queries.
Knowledge of descriptive and inferential statistics, including hypothesis testing.
Ability to apply supervised and unsupervised machine learning algorithms for clustering, classification, and dimensionality reduction.
Ability to communicate information in verbal or written formats to a senior executive audience.
Knowledge of commercial Cloud services used by the IC.
ABBTECH is an EOE/Minorities/Women/Disabled Individuals/Veterans
Develop and implement statistical predictive models and machine learning algorithms
Develop predictive models and machine learning algorithms using advanced methodologies
Develop statistical learning models for data analysis
Design and develop data mining and machine learning models and algorithms
Using machine learning and statistical techniques
Develop statistical models for data analysis
Demonstrate an advanced understanding of data mining, predictive analytics, machine learning and data visualization
Support analytics and machine learning model development
Implement data driven business solutions using advanced statistical methods and machine learning techniques
Clean and explore data and implement statistical/machine learning models
Create machine learning data models that develop new insights over time
Providing advanced predictive data analytics using big data and data science technology for healthcare innovation
Generate analytics on big data
Solve statistical, machine learning, analytical and data mining problems
Leverage statistical data modeling, data mining and machine learning techniques to provide solutions to new business problems
Deploy predictive models and/or machine learning algorithms on large static and/or streaming data sets
Optimizing classifiers using machine learning techniques
Building the coolest machine data analytics systems
Using various types of algorithms and machine learning modeling techniques
Define and develop machine learning and data mining strategies