The Data Scientist is responsible for conducting undirected research and tackle open-ended data problems and questions. Drawing on an advanced degree in a quantitative field such as computer science, physics, statistics or applied mathematics, the Data Scientist demonstrates the knowledge to invent new algorithms to solve data problems.
-
Research and assess next-generation technologies for machinery diagnostics and prognostics and data-driven modeling and optimization of complex systems.
-
Demonstrate advanced working knowledge and experience with machine learning algorithms and population-based meta-heuristic optimization methods.
-
Generate innovative ideas, establish new research directions, and shape and execute on technical projects.
-
Maintain state-of-the-art knowledge and contribute to technical discussions and reviews as an expert in related areas of responsibility.
-
Apply theoretical knowledge to solve industrial problems.
-
Process large multivariate data sets collected from equipment operations, manufacturing tests and diagnostic routines.
-
Apply engineering knowledge in developing data-driven algorithms for anomaly detection, failure prediction and optimization.
-
Communicate ideas, plans and results effectively via oral and written reports.
-
Collaborate with field and product engineers to identify key health monitoring parameters of a system.
Required Skills & Qualifications
-
Programming: High proficiency in Python (specifically Pandas for data manipulation and NumPy for numerical computing).
-
Database Management: Ability to write SQL queries to extract data from relational databases.
-
Statistical Foundation: Understanding of probability, hypothesis testing, and statistical significance.
-
Machine Learning: Familiarity with ML libraries like Scikit-Learn.
-
Education: Currently pursuing or recently completed a degree (BS/MS) in Data Science, Statistics, Computer Science, Computer Engineering, Mathematics, or a related field.
Bonus
-
Basic exposure to LLMs or Prompt Engineering (e.g., using OpenAI or LangChain APIs).
-
Creating Dashboards and visualizations (ex. PowerBI)