Data Scientist - GenAI Models Job at VDart Inc, Remote

NmhVTTJKQUQyZ2dLTXVXamJUSkR5bWw1dFE9PQ==
  • VDart Inc
  • Remote

Job Description

Job Title : Data Scientist GenAI Models and AI Agentic Systems

Location : Remote candidates are fine

contract

Look for senior resource

Position Overview:

We are seeking a Data Scientist with strong expertise in evaluating open-source Generative AI models, creating

compelling data visualizations, and designing agentic AI workflows. This is a unique opportunity to work on the

frontier of Generative AI, validating LLMs and agents that power critical enterprise Legal use cases. This position

will work within the Legal department. While there is some flexibility in location, work hours for this position are

expected to largely align to US standard working days.

The ideal candidate is fluent in the nuances of GenAI model performance, understands both open-source models

(e.g., LLaMA 3, Mistral, Falcon, Gemma) and enterprise-grade models like Granite, and is comfortable

developing multi-agent orchestration pipelines using frameworks such as LangGraph, CrewAI, AutoGen, or

LangChain Agents. You will also collaborate with MLOps teams to deploy and evaluate models using platforms

like Hugging Face, MLflow, and Weights & Biases.

Key Responsibilities:

Model Validation & Evaluation

Design rigorous evaluation pipelines for GenAI models, including IBM Granite models and open-source

alternatives hosted on Hugging Face Hub.

Define metrics and build test suites to assess model behavior across factuality, coherence, bias, safety,

hallucination, and robustness.

Leverage tools such as Ragas, TruLens, Giskard, or LM Eval Harness to compare performance across

prompts, tasks, and domains.

Data Visualization & Insight Communication

Create compelling dashboards and visual narratives to track model performance, error distributions, and

drift over time.

Use libraries like Plotly, Dash, Seaborn, or Matplotlib to build visuals that support real-time and

retrospective evaluations.

Summarize and present findings to cross-functional teams, including product owners, compliance

stakeholders, and LLM developers.

Agentic Frameworks & AI Workflows

Develop and validate multi-agent systems using frameworks like LangGraph, CrewAI, LangChain

Agents, or AutoGen.

Optimize agent roles, memory, and tool selection for task-oriented pipelines in legal, enterprise, or research

domains.

Evaluate how different orchestration strategies (e.g., sequential vs. parallel, static vs. dynamic role

assignment) affect task quality and efficiency.

Required Qualifications:

3+ years of experience in data science, NLP, or ML research roles, or equivalent academic credentials.

Practical experience evaluating open-source LLMs (e.g., LLaMA, Mistral, Falcon, Gemma) and

familiarity with IBM Granite model capabilities.

Fluency in Python and data science tools: Pandas, NumPy, Scikit-learn, Jupyter, and at least one

visualization library.

Familiarity with GenAI evaluation frameworks and benchmarks (e.g., TruthfulQA, MMLU, BBQ,

Hellaswag).

Experience with versioned model repositories and libraries via Hugging Face Transformers and Datasets.

Minimum Master's degree in computer science, Data Science, math or related

Preferred Qualifications:

Demonstrated experience with agentic frameworks (e.g., LangGraph, CrewAI, AutoGen, or LangChain

and Langflow Agents).

Knowledge of Retrieval-Augmented Generation (RAG), vector databases (e.g., FAISS, Weaviate,

Chroma), and hybrid retrieval strategies.

Exposure to prompt tuning, chain-of-thought prompting, function/tool calling, and memory-aware agents.

Prior work in regulated environments with a focus on explainability, auditing, or trustworthy AI.

Experience deploying and monitoring models in cloud platforms (e.g., AWS SageMaker, Azure ML, IBM

watsonx.ai

Job Tags

Contract work, Remote work,

Similar Jobs

EssilorLuxottica Group

P745 Sunglass Outfitters in Bass Pro Shops Sales Associate Job at EssilorLuxottica Group

 ..., Oliver Peoples, Vogue Eyewear and Costa), as well as a network that offers consumers high-quality vision care and best-in-class shopping experiences (such as Sunglass Hut, LensCrafters, Salmoiraghi & Vigan and the GrandVision network), and leading e-commerce platforms... 

Yexgo

Remote Data Entry Clerk - Typing - Part Time Entry Job at Yexgo

About The JobThis is your opportunity to begin a lifelong profession with unlimited opportunity. Discover the flexibility you've been looking for by taking a minute to finish our online application.Minimum requirements:BenefitsExcellent weekly paySafe work environment...

Bill & Tim's Barbeque & Tap House

Dishwasher Job at Bill & Tim's Barbeque & Tap House

 ...Come join a great company with a 30-year history in Oregon! DickieJo's Burgers is a fast casual...  ...average total hourly income of $16.70 to $17.20 per hour (possibly more with relevant...  ...required. You must be at least 17 years old to apply and 18 years old to apply for cashier... 

Pristine Painting and Coatings

Floor Coatings Installer Job at Pristine Painting and Coatings

 ...Feeling Unappreciated? Join Our 'Ohana Are you a skilled polyurea, polyaspartic flake floor installer who feels your hard work goes unnoticed? Are you tired of being just another face and not a valued member of the team? At Pristine Painting & Coatings, we understand... 

Spectrum

2026 Summer Intern: Associate Data Analyst Job at Spectrum

 ...within the telecommunications industry in your field of choice. Internship program runs from May 27 through July 31, 2026. In addition...  ...Leveraging analytical skills to extract and evaluate data to deliver actionable insights that support customer operations...