Cadre Job Opening

DATA SCIENTIST

Role Description: Data Scientist


Cadre AI is looking for a talented Data Scientist to join our dynamic team and help drive data-driven decision-making and the development of AI-powered solutions. We’re seeking a candidate with strong technical expertise in data analysis, machine learning, and statistical modeling who has a proven record of deriving insights from complex datasets and building scalable data solutions.


We are seeking a talented Data Scientist with expertise in Generative AI, proficiency in working with Large Language Models (LLMs) such as OpenAI's GPT series, Anthropic's Claude, Google's Gemini, and familiarity with open-source models like Llama. Additionally, you should have strong analytical skills and experience in model evaluation and data-driven experimentation. In this role, you will design, develop, and implement data models, algorithms, and analytical tools across various projects within the company. Your work will span multiple domains, allowing you to collaborate with cross-functional teams and tackle a wide range of data challenges.


Responsibilities:


  • Data Analysis and Modeling: Analyze large and complex datasets to extract actionable insights and inform decision-making.

    • Develop predictive and prescriptive models using machine learning techniques, with a focus on Generative AI and LLMs.

    • Build models for use cases such as customer segmentation, recommendation systems, time-series forecasting, and anomaly detection.

  • Experimentation and Model Evaluation: Design and conduct experiments to validate hypotheses and optimize model performance.

    • Evaluate ML models and LLMs using appropriate metrics, conducting rigorous testing to ensure quality and reliability.

  • Data Preparation and Feature Engineering: Collect, preprocess, and transform raw data into clean and structured formats suitable for analysis and modeling.

    • Engineer and select relevant features to improve model performance.

  • Cross-Functional Collaboration: Work closely with software engineers, AI engineers, product managers, and other stakeholders to integrate data science solutions into products and services.

    • Provide data-driven insights to inform product strategy and business decisions.

  • Research and Innovation: Stay up-to-date with the latest advancements in data science, machine learning, and AI.

    • Explore and implement cutting-edge techniques and tools to improve our data capabilities.

  • Data Visualization and Communication: Create compelling data visualizations and dashboards to communicate insights effectively to both technical and non-technical stakeholders.


  • Present findings and recommendations in a clear and concise manner.


  • Deployment and Monitoring: Collaborate on deploying models and analytical tools into production environments.

    • Monitor performance and make iterative improvements based on real-world results.


Qualifications

  • Educational Background: Bachelor's or Master's degree in Data Science, Computer Science, Statistics, Mathematics, or a related field.

  • English B2 or above

  • Experience: 5+ years of experience as a Data Scientist or in a similar role.

  • Experience with LLMs and Generative AI: Hands-on experience with LLMs such as OpenAI GPT-4, Anthropic's Claude, Google's Gemini, or open-source models like LLaMa.

    • Experience leveraging LLMs for data augmentation, text generation, or natural language understanding tasks.

  • Machine Learning and Statistical Skills: Strong understanding of machine learning algorithms, deep learning, and statistical methods.

    • Ability to choose appropriate models for specific business problems (e.g., regression, classification, clustering, etc.).

  • Programming and Data Engineering: Proficiency in Python, SQL (Postgres), and experience with machine learning frameworks such as Scikit-learn or PyTorch.

    • Familiarity with data pipeline tools and data warehouse platforms like Snowflake.

  • Problem-Solving Skills: Strong analytical and critical thinking skills to solve complex data challenges.

  • Communication: Excellent verbal and written communication skills for effective collaboration and presentation.

  • Team Player: Ability to work collaboratively in a multidisciplinary environment.


Preferred Qualifications


  • Cloud Experience: Familiarity with cloud platforms like AWS, Google Cloud, or Azure for deploying data solutions.

  • ETL and Data Warehouse: Experience with ETL frameworks like Spark/DBT and setting up data warehouses (Snowflake)

  • Domain Knowledge: Experience in industries such as finance, healthcare, retail, or technology is a plus.

  • Open-Source Contributions: Contributions to open-source data science or machine learning projects.


If you're passionate about using data to drive impactful decisions and eager to work on cutting-edge AI technologies, we’d love to hear from you! Join Cadre AI and contribute to shaping the future of data innovation.