Position Title: Data Scientist
Company Overview: Capgemini Engineering is a global leader in engineering services, bringing together a worldwide team of engineers, scientists, and architects to assist the most innovative companies in unleashing their potential.
Position Overview: We are seeking a skilled Data Scientist with expertise in Cognite Data Fusion, data modelling, Unified Namespace (UNS), ontologies, and the identification of data products and datasets. The ideal candidate will have a strong background in developing and implementing data science projects, analyzing large and complex data sets, and driving data-driven decision-making across the organization
Key Responsibilities:
-
Solution Development: Design, implement, and deploy scalable data solutions utilizing Cognite Data Fusion, focusing on data modeling, UNS, and ontologies to address industry-specific challenges.
-
Data Analysis: Analyze large and complex data sets to identify trends, insights, and opportunities, supporting solution development and business strategies.
-
Collaboration: Collaborate with cross-functional teams to understand data needs and translate them into data science solutions, ensuring seamless integration and operationalization of digital solutions across various domains.
-
Client Engagement: Engage with clients to understand their business objectives, lead discovery workshops, and provide expert guidance on data-driven strategies and potential challenges.
-
Visualization: Develop dashboards and visualizations using tools such as Power BI, Grafana, or web development frameworks like Plotly Dash and Streamlit to effectively communicate data insights.
-
Mentorship: Provide guidance and mentorship to junior team members, promoting best practices in data science and software development.
Qualifications:
-
Educational Background: Master’s or PhD degree in a quantitative field.
-
Experience: Minimum of 2 years of experience in data science, with a strong background in developing analytical solutions within domains such as pharma, oil and gas, manufacturing, or power & utilities.
-
Technical Skills: Proficiency in Python and its data ecosystem (pandas, numpy), machine learning libraries (scikit-learn, keras), and experience with SQL.
-
Visualization Tools: Experience with data visualization tools like Power BI, Grafana, Tableau, or web development frameworks such as Plotly Dash and Streamlit.
-
Software Practices: Strong understanding of software development practices, including version control (e.g., Git), automated testing, and documentation.
-
Cloud Platforms: Experience with cloud services such as GCP, Azure, or AWS is advantageous.
-
Domain Knowledge: Familiarity with industrial data management concepts, including Unified Namespace (UNS), ontologies, and data product identification.
-
Communication Skills: Excellent communication and collaboration skills, with the ability to work with cross-functional teams and stakeholders.
-
Leadership: Demonstrated ability to lead projects and mentor junior team members.
Preferred Qualifications:
-
Industry Expertise: Experience serving as a domain expert on internal or customer projects within relevant industries.
-
Cloud Deployment: Experience deploying models and solutions in production environments using cloud infrastructure.
-
Continuous Learning: Willingness to stay updated with the latest developments in data science and related technologies.