Engility Data Science and Analytics/Machine Learning/Deep Learning Computational Scientist in Dayton, Ohio

About Engility:

Engility delivers innovative solutions to critical challenges facing the nation and the world. As a premier provider of integrated services for the U.S. government, we support the Department of Defense, intelligence community, space communities, federal civilian agencies and international customers. Engility is dedicated to making lives better, safer and more secure.


Engility is looking for a Data Science and Analytics/Machine Learning/Deep Learning Computational Scientist as part of the User Productivity Enhancement, Technology Transfer, and Training (PETTT) contract for the DoD High Performance Computing Modernization Program (HPCMP). Responsibilities include: Provide advanced science level on-site support to DoD HPC users in the area of Big Data Analytics with a primary focus on Python based tools for machine learning/deep learning, neural networks and Bayesian methods, and applied areas such as computer-vision (image representation, object recognition, and caption generation).

The successful candidate will interact with members of the HPCMP and senior members of the DoD S&T complex to identify research areas where high performance data analytics (HPDA) can be applied to domains of interest to DoD, such as high throughput screening, automatic target recognition (ATR), autonomy, and other emerging areas. Upon identification of requirements and promising research approaches, the successful candidate will lead and work with other team members within the Data Science and Analytics team on the development and implementation of new workflow tools to promote the adoption of HPDA within the HPCMP community. This includes science gateways, interactive capabilities, databases, custom tools installation and integration, optimization, data wrangling, data formats and management, and data mining, as well as leveraging new technologies such as containers. The ideal candidate will have the ability to prototype new solutions that work in restricted environments on DoD HPC systems and troubleshoot underlying technologies such as network and login processes, web technologies, and data management frameworks. Travel (less than 10%) is required. Relocation assistance is available.

Required Qualifications:

  • Education: PhD in in computer science or data science related area.

  • Strong background in multiple data science areas, such as math/statistics, machine/deep learning, data mining, data management, web technologies, or databases.

  • At least 10 years of work experience.

  • At least 5 years’ experience in engineering and machine learning.

  • Proficient in high performance Python and Python based data analytics tools such as parallel python packages, Pandas, Numpy, SciPy, Scikit-learn, and Matplotlib.

  • Proficient in R and SQL.

  • At least 6-8 years of experience with databases (RDBMS and NoSQL) and multiple file formats.

  • At least 6-8 years of HPC experience in software development and enhancement of tools to include PBS batch scheduling systems.

  • Strong background in distributed or parallel computing with experience in parallel Python packages, MPI, or other distributed computing frameworks.

  • Experience with deep learning frameworks, particularly TensorFlow or Caffe, but also others such as Keras, Theano, or Torch.

  • Excellent communication, presentation, collaboration and leadership skills.

  • U.S. Citizenship required. Must hold final Secret level clearance to start.

Desired Qualifications:

  • Proficient in other data science supporting languages such as Java, Javascript, Octave, Matlab, or Ruby.

  • Experience with developing and deploying container based data analytics workflows, particularly in parallel, such as with MPI frameworks.

  • Strong understanding of big data concepts and familiarity with Apache Big Data Stack tools, particularly Spark, but also others such as Kafka, Storm, Hadoop, Pig, Mahout, Hive, etc.

  • Experience with data analytics tools such as NeuroBayes, RapidMiner, etc., is a big plus.

  • Familiarity with agile software development practices.

