Job Overview:
We are seeking an experienced
Data Scientist
to join our growing data team. The ideal candidate will have a strong analytical mindset, experience with large and complex datasets, and a solid foundation in statistical modeling and machine learning. You will play a critical role in uncovering insights, building data-driven solutions, and supporting strategic decisions across the organization.
Key Responsibilities:
-
Analyze large, complex structured and unstructured datasets to derive actionable insights.
-
Develop, validate, and deploy statistical and machine learning models.
-
Use SQL, Python, or R to manipulate data and draw insights from large datasets.
-
Design and build visualizations using tools like Power BI, Tableau, or Looker to communicate findings.
-
Implement parameter reduction techniques such as feature selection and PCA to optimize models.
-
Build and manage end-to-end data workflows in Dataiku (DSS) including data ingestion, preparation, modeling, and deployment.
-
Work with APIs and integrate external/internal data sources for enhanced analytics capabilities.
-
Apply LLM techniques including prompt engineering and fine-tuning to relevant business problems.
-
Experiment with and integrate vector databases and embedding-based approaches for LLM use cases.
-
Collaborate with cross-functional teams including product, engineering, and business stakeholders.
-
Ensure accuracy, consistency, and quality in data deliverables.
Requirements
Required Qualifications:
-
3+ years of experience as a Data Scientist, Data Analyst, or similar role.
-
Strong proficiency in SQL and Python or R.
-
Hands-on experience with data visualization tools (Power BI, Tableau, Looker, etc.).
-
Solid understanding of statistics, data modeling, hypothesis testing, and machine learning.
-
Experience with parameter reduction techniques (e.g., PCA, feature selection).
-
Proven experience working with APIs, large datasets, and both structured/unstructured data.
-
Experience using Dataiku (DSS) for building data pipelines and deploying models.
-
Familiarity with LLMs (prompt engineering, fine-tuning) and vector database technologies.
-
Bachelor's degree in Data Science, Statistics, Computer Science, Economics, or a related field.
-
Experience in the financial services or investment domain is a plus