Gloo

Senior Data Scientist, AI

Posted Yesterday

Be an Early Applicant

In-Office

Pittsburgh, PA

Senior level

In-Office

Pittsburgh, PA

Senior level

The Senior Data Scientist will design and implement evaluation metrics and experimental frameworks for AI models while collaborating with cross-functional teams to drive insights that align with Gloo's mission.

The summary above was generated by AI

Join Us at Gloo: Help Us Build the Leading Technology Platform to Serve the Faith Ecosystem.
At Gloo, we believe relationships catalyze growth-and when technology serves relationships, the world can be transformed, one life at a time. We recognize the faith ecosystem is broad and diverse. It includes churches, parachurch ministries and thousands of non-faith based organizations committed to advancing the flourishing of people. Our purpose is simple yet profound: to shape technology as a force for good, so that people can flourish and communities can thrive.. Therefore, our mission is to build the leading technology platform to serve the faith ecosystem.
We are a mission-driven organization, guided by four core principles:

Shape Technology for Good: Whether it's AI tools, dashboards, or communication tools, we empower leaders with values-aligned solutions that amplify their missional impact with tools they can trust.
Serve Those Who Serve: From pastors and donors to network leaders and content creators-we're passionate about providing technology that helps them do more of what they are called to do.
Release Collective Strength: We connect the diverse faith ecosystem-churches, parachurch ministries and thousands of non-faith based organizations committed to advancing the flourishing of people-so they can achieve more together than they could alone.
Enable Ecosystem Trust: Trust is the foundation of a connected faith community. We enable transparency, digital rights management, and simplicity to foster confident collaboration.

We are looking for talented, mission-driven individuals to join our team as we create innovative technology solutions that inspire and empower people to release the passion in every person, including you, to be all they were created to be. If you're ready to make an impact with your talents and skills toward this purpose, we'd love to hear from you !
The Opportunity:
Join Gloo's AI Research & Data Science team and turn frontier LLM research into evidence-backed product decisions that help people grow. You'll own the end-to-end experimentation loop-from designing causal tests and faith-aligned evaluation metrics to automating dashboards that keep model quality, bias, and cost in check. Pastors, parents, and everyday seekers will trust the insights we deliver because you'll make sure the data behind them is rock-solid.
Your day-to-day blends hands-on analytics with strategic impact:

Morning: run a power analysis on yesterday's live-traffic experiment, then pull terabytes of telemetry from Snowflake to fine-tune a hallucination detector in PyTorch.
Mid-day: pair with an ML engineer to wire your new "faith-alignment" score into the CI pipeline so every model checkpoint is auto-graded before it hits staging.
Afternoon: present a causal-impact report to product and design, translating statistical nuance into a clear go/no-go decision for next week's launch.

You won't do it alone. You'll collaborate daily with research scientists, backend engineers, PMs, and Design who care as deeply about human flourishing as they do about clean code. Together you'll ship world-class eval suites, trusted data pipelines, and self-service insights that let others build on our work.
If you're ready to trade big-tech bureaucracy for autonomy, mission, and the chance to invent what doesn't exist yet, we'd love to meet you. This hybrid role is based in Pittsburgh, PA or Palo Alto, CA, with quarterly summits in Boulder and the freedom to see your ideas move from prototype to production-fast.
What You'll Do:
Research & Technical Excellence

Design gold-standard evaluation pipelines. Build offline + online test harnesses that quantify accuracy, hallucination, bias, latency, cost, and faith alignment for every new model checkpoint-then light them up in CI so bad pushes never reach staging.
Invent metrics that matter. Translate "human flourishing" into measurable signals (e.g., uplift scores, pastoral-fit indices) and validate them with causal inference, inter-rater reliability, and power analyses.
Champion testability at data scale. Instrument data pipelines to emit lineage, privacy flags, and drift stats; automate regression and shadow-traffic tests so research moves fast without breaking trust.
Advance the craft. Publish internal whitepapers, run journal clubs on causal ML and LLM evaluation, and mentor engineers in statistical thinking and experiment design.

Delivery

Ship decision-ready insights. Own the end-to-end loop-from query in Snowflake or Weaviate, through feature engineering in PyTorch, to a Looker dashboard or Slack bot that PMs use daily.
Accelerate cycles, not risk. Template experiment frameworks (A/B, switchback, Bayesian bandits) so product teams can launch tests in hours; automate cleanup, analysis, and "stop/keep/scale" recommendations.
Raise the performance ceiling. Hunt bottlenecks-slow ETL, expensive inference, noisy labels-and knock them down with better schemas, caching, or active-learning pipelines.

Strategic Clarity

Align analytics with mission. Partner with product and research to frame how new metrics, datasets, or safety guardrails power coaching, discipleship, and broader human-flourishing outcomes.
Model trade-offs. Quantify the ROI of additional GPU spend, bigger context windows, or stricter privacy filters, then brief execs with clear "yes / later / never" options.
Codify best practice. Keep living docs on metrics definitions, experiment checklists, and data-governance rules so every team can self-serve without reinventing the wheel.

Collaboration

Default to cross-functional. Pair with infra on scalable data stores, with security on PII governance, and with UX on in-product instrumentation that captures the right events the first time.
Create productive tension. Ask the hard statistical questions early, surface risks, and drive "disagree-and-commit" clarity so launches stay both fast and safe.

Leadership & Influence

Mentor over manage. Coach junior analysts and engineers; review code, notebooks, and dashboards to raise the team's statistical bar.
Lead from the data. When the numbers disagree with intuition, speak up-candidly but constructively-then rally the group around evidence-based solutions.
Institutionalize learning. Run retros on every major experiment, celebrate lessons (wins or fails), and bake micro-improvements into the next cycle.

In short: you'll turn terabytes of raw signals into trustworthy metrics, experiments, and narratives that keep our AI honest and our mission on track-ensuring every insight we ship genuinely helps someone flourish.
What We Are Looking For:

Advanced degree or demonstrable equivalent (peer-reviewed research or high-impact data-science products).
5+ years in product or research data-science roles designing experiments and shipping insights.
Expert SQL and Python (Pandas, NumPy); own end-to-end ETL → analysis → dashboard.
Deep fluency in causal inference, A/B and switchback testing, and metric design.
Proven ability to evaluate or partner on large-model analytics; familiarity with LLM eval best practices.
Skilled communicator who turns statistical nuance into decisive recommendations.

Preferred Qualifications

Hands-on LLM evaluation or bias-audit work; prompt analysis tooling.
Modern MLOps familiarity (Ray, Airflow, Kubernetes) and GPU cost telemetry.
Publications at NeurIPS/ICML/KDD or open-source repos > 500 stars.
Prior work in mission-driven or faith-aligned settings.

Job Location:

Hybrid in Sewickley, PA
Hybrid in Palo Alto, CA

Compensation:

Sewickley, PA - $125,000 - $175,000
Palo Alto, CA - $175,000 - $225,000

Our Team Members Enjoy:
Competitive compensation and discretionary performance bonus commensurate with experience

Flexible PTO policy and state-compliant sick leave to support your well-being
Medical, Dental, and Vision plans with up to 90% coverage for employees
Generous employer HSA contributions for HDHP elections
Employer-sponsored 401k program with a 2% employer match
Learning & Development stipend available after 6 months of employment
Paid Parental Leave
A dynamic, talented team, dedicated to changing the world and building an incredible business
Onsite and virtual social events to keep us connected in our hybrid work environment

Applicants must be currently authorized to work in the United States on a full-time basis. At this time, Gloo is only able to consider candidates who are U.S. Citizens or U.S. Permanent Residents.
Gloo is committed to providing an inclusive and accessible experience for all candidates. If you require a reasonable accommodation during the application or interview process, please contact us at [email protected] to let us know how we can support you.
Job is posted until filled.

Top Skills

Airflow

Kubernetes

Looker

Numpy

Pandas

Python

PyTorch

Ray

Snowflake

SQL

Weaviate

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.