Reins AI
 is hiring a fractional

Reliability Data Scientist

Added 

x

 - Syndicated from 
Company Website

How Syndicated Job Posts Work

This job was not posted directly to Fractional Jobs. It’s syndicated from another platform
To apply, view the application and follow their guidelines
Please let them know that Fractional Jobs sent you!

This Role is Closed

This company has already made a successful hire.
Fractional jobs get filled quickly. To get alerted when new fractional jobs go live, subscribe to our alerts.

This is a Featured Job

100% guarantee that your intro request will be seen
You’ll receive an update within 14 days
If the company is interested, we’ll intro the two of you directly

Weekly Commitment

20 hrs

Compensation Range

Unknown

Company Stage

Early-stage VC

Industry

AI

Location

Remote
moonlight ok
moonlight ok
convert full-time
convert full-time
equity offered
equity offered
hands-on needed
hands-on needed

Note: We've kept the name of the company private. If you'd like to know the company before requesting an intro, just email us at hello [at] fractionaljobs.io

Role Overview

At Reins AI, data scientists define and operationalize how we measure reliability in real-world AI systems. You’ll bridge evaluation design and data analysis, crafting the test logic behind our reliability dashboards and weekly reports. Working across regulated audit and finance contexts, you’ll translate evaluation scenarios into structured metrics, visualizations, and summaries that help our clients see what’s working, what’s drifting, and what needs triage. You’ll collaborate closely with our Solutions Architect and Reliability Lead to connect monitoring data (Grafana, LangSmith, Arize) with simulations and context-engineering workflows, building the analytical backbone of AI Ops reporting.

Responsibilities

  • Partner with domain and monitoring leads to define evaluation scenarios and metrics  
    (quality, suitability, reliability).
  • Build and maintain evaluation datasets, golden traces, and error taxonomies.
  • Develop and maintain weekly reliability dashboards and summary reports (Grafana, Python,  
    SQL, or notebooks).
  • Analyze evaluation results for drift, outliers, and context-dependent failures; flag issues for  
    triage and verification loops.
  • Collaborate with engineers to automate scoring and aggregation pipelines.
  • Validate evaluator reliability and calibration against human judgments.
  • Document test logic, metric definitions, and interpretation guidance for repeatability.
  • Support context-engineering workflows by designing metrics that measure predictability,  
    observability, and directability.

Qualifications

  • 3-6 years in data science, analytics, or ML evaluation roles.
  • Experience building dashboards and automated reports (Grafana, PowerBI, or similar).
  • Strong Python, SQL, and data-wrangling skills.
  • Familiarity with evaluation design concepts (sampling, calibration, pass/fail criteria).
  • Excellent communication: can turn technical data into clear, decision-ready insights.

Preferred Skills

  • Background in AI system monitoring, LLM evaluation, or reliability engineering.
  • Familiarity with LangSmith, OpenInference, or similar tracing frameworks.
  • Experience with synthetic or simulated data analysis.
  • Understanding of regulated domains (audit, finance, healthcare).

Employment Details

This will start as a 4–6 month contract engagement (20 hours/week) with a clear path to full‐ time employment as we finalize 2026 project scopes. We’ll jointly evaluate fit, scope, and structure during that period.

Optimal start date:  December 15, 2025


How to Apply

Note: This is a syndicated job post. Fractional Jobs found it on the web, but we are not working with the client directly, so we don't have control over or knowledge of the application process. To apply, click on the "View Application" button and follow the application's instructions. Let us know how it goes!


How to Get in Touch

Hit that "Request Intro" button below. Include any relevant links so we can get to know you better.

Your brief intro note should clearly address:


If we think there's a fit, we'll reach out to schedule an intro call. Looking forward!

x
More
Engineering
Jobs

Bright North Peak

 - 

Chief Technology Officer

 

5 - 10 hrs
 | 
$80 - $120 / hr
 | 
Remote (USA only)
Engineering
Syndicated
May 18, 2026
chief-technology-officer-at-bright-north-peak
added 

Xenon7

 - 

Palantir Expert Advisor

 

2 - 5 hrs
 | 
$100 / hr
 | 
Remote (USA only)
Engineering
Syndicated
May 18, 2026
palantir-expert-advisor-at-xenon7
added 

An AI Action Authorization Startup

 - 

Systems Architect

 

(
)
10 - 20 hrs
 | 
$150 - $200 / hr
 | 
Remote (Worldwide)
Engineering
Syndicated
May 15, 2026
architect-at-an-ai-action-authorization-startup
added 

A Longevity Healthtech Startup

 - 

Chief Technology Officer

 

(
)
10 - 20 hrs
 | 
$8K - $15K / mo
 | 
Remote (USA only)
Engineering
Syndicated
May 13, 2026
chief-technology-officer-at-a-longevity-healthtech-startup
added 

Healthcare-focused AI Model Validation Platform

 - 

Chief Technology Officer

 

(
)
2 - 10 hrs
 | 
Up to $200 / hr
 | 
Remote (Worldwide)
Engineering
Syndicated
May 5, 2026
chief-technology-officer-at-a-healthcare-focused-ai-model-validation-platform
added 

Catch Creation

 - 

Chief Technology Officer

 

10–20 hrs
 | 
Unknown
 | 
Remote (USA or Canada only)
Engineering
Syndicated
April 30, 2026
chief-technology-officer-at-catch-creation
added 

An ESL Edtech AI Startup

 - 

ML Engineer

 

(
)
20 hrs
 | 
$80 - $160 / hr
 | 
Remote (UK/EU/Asia preferred)
Engineering
Syndicated
April 18, 2026
ml-engineer-at-an-esl-edtech-ai-startup
added 

Send fractional jobs, 

playbooks, and more to

You’re in! Check your inbox to confirm.
We also post job alerts on
&
Hhmm, try again. That didn’t work.