Tether.io Logo

Tether.io

Machine Learning Systems Engineer

Reposted 7 Days Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
The Senior Applied ML Engineer will architect backend systems for a media intelligence platform, integrating AI/ML services, optimizing workflows, and overseeing large media processing pipelines.
The summary above was generated by AI

Join Tether and Shape the Future of Digital Finance

At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Innovate with Tether

Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.

But that’s just the beginning:

Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.

Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.

Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.

Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.

Why Join Us?

Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry.

If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.

Are you ready to be part of the future?

About the job

We are developing a highly scalable media intelligence platform that processes, analyzes, and structures large volumes of multimedia content across text, image, video, and audio. As a Senior Applied ML Engineer, you will architect and build the core backend systems that power media ingestion, processing workflows, metadata generation, AI-based analysis, semantic search, and retrieval across large media libraries.

We are looking for a Senior Applied ML Engineer who can design, implement, optimize, and evaluate a production-grade moderation pipeline using open-source models.

This role requires deep backend engineering expertise, strong system design capability, and practical experience integrating AI/ML systems into production workflows. You will work on complex media-processing pipelines, video/audio analysis, OCR, speech-to-text, embedding generation, vector search, multimodal model integrations, and high-throughput asynchronous workloads. You will collaborate closely with engineering leadership to define backend architecture, improve reliability and scalability, and guide other engineers in delivering secure, observable, and high-performance systems.

Responsibilities

Backend Architecture & System Ownership

  • Architect, build, and operate scalable backend services for a media intelligence platform, with a focus on clean, maintainable, and production-ready systems.

  • Own critical backend components end to end, from system design and API contracts through implementation, deployment, monitoring, and iteration.

  • Drive architectural decisions across APIs, processing pipelines, distributed compute, storage, search, observability, cloud infrastructure, and model-serving workflows.

  • Design data models and storage patterns for media assets, generated metadata, embeddings, processing jobs, model outputs, search indexes, and audit trails.

  • Design high-throughput media ingestion and processing pipelines for large volumes of video, audio, image, and text content.

  • Build distributed, event-driven workflows for media processing using queues and pub/sub systems such as SQS, Kafka, Pub/Sub, or equivalent technologies.

  • Implement reliable asynchronous processing patterns, including retries, idempotency, dead-letter queues, backpressure handling, and fault-tolerant job execution.

AI/ML Integration & Model Workflows

  • Lead the development and optimization of metadata extraction, content analysis, scene detection, transcription, embedding generation, and multimodal AI inference workflows.

  • Integrate and optimize AI/ML services within backend workflows, including model APIs, embedding pipelines, OCR, speech-to-text, scene analysis, multimodal inference, batching, caching, and fallback strategies.

  • Collaborate with ML engineers, data scientists, or external model providers to benchmark models, compare quality/latency trade-offs, and safely roll out model upgrades.

Model Serving & Performance Optimization

  • Optimize AI/ML inference workflows for latency, throughput, reliability, and cost across both real-time and batch-processing paths.

  • Work with model-serving systems such as vLLM, Triton, TGI, SageMaker, Vertex AI, or custom inference services to improve batching, concurrency, warmup behavior, timeout handling, autoscaling, and GPU utilization.

  • Evaluate and apply practical model optimization techniques such as quantization, model distillation, batching, caching, prompt optimization, and routing to smaller or cheaper models where appropriate.

  • Design and maintain vector search and indexing systems using technologies such as Pinecone, Weaviate, Qdrant, Elastic Vectors, FAISS, pgvector, or similar tools.

  • Build retrieval workflows that support semantic search, similarity matching, duplicate detection, media discovery, and structured metadata search.

  • Monitor model and system performance in production, including API latency, queue depth, processing time, model error rates, GPU utilization, confidence distributions, drift signals, and cost per processed item.Search, Indexing & Data Retrieval

Infrastructure, Reliability & Observability

  • Deploy and operate systems on AWS, GCP, Azure, or equivalent cloud platforms, including compute, storage, networking, queues, model-serving infrastructure, and monitoring systems.

  • Ensure system reliability through logging, metrics, tracing, alerting, dashboards, operational runbooks, and incident-response best practices.

Collaboration & Engineering Leadership

  • Collaborate with product, design, data, and ML teams to deliver media-rich, AI-powered product features.

  • Mentor junior and mid-level engineers, support technical planning, review designs, and raise engineering quality across the team.

  • Participate in code reviews, documentation, technical planning, and continuous improvement of engineering practices.

  • Ensure code quality through testing, peer review, clear documentation, and maintainable implementation patterns.

Education & Experience

  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience.

  • 5-7+ years of backend engineering experience, ideally building scalable distributed systems, media platforms, data pipelines, or high-throughput backend services.

  • Prior experience owning major backend modules end to end, including architecture, implementation, deployment, monitoring, and production operations.

  • 3 + years of experience integrating AI/ML inference systems into backend workflows, including model APIs, embedding pipelines, OCR, speech-to-text, scene detection, or multimodal model outputs.

  • Hands-on experience creating AI-powered processing pipelines for image, video, audio, or text analysis.

  • Practical experience with production model optimization, especially for image, video, embedding, or multimodal models, including batching, caching, quantization, prompt optimization, routing strategies, latency reduction, and cost optimization.

  • Prior experience with vector search, semantic search, media retrieval, or similarity-matching systems is strongly preferred.

  • Experience mentoring engineers, leading technical discussions, and influencing architectural decisions across backend, infrastructure, and AI/ML workflows.

Technical Skills

  • Strong expertise in Python and/or Node.js with Deep understanding of building scalable RESTful APIs and backend architectures

  • Experience with HuggingFace transformers ecosystem and deep learning frameworks such as PyTorch and TensorFlow.

  • Strong experience with SQL/NoSQL databases, schema design, and data modeling

  • Preferred Exposure to distributed systems, microservices, asynchronous processing, and event-driven patterns with SQS, Pub/Sub, Kafka, or other queueing/pub-sub systems

  • Experience deploying production systems on AWS, GCP, or similar cloud platforms

  • Knowledge of infrastructure patterns (compute, storage, networking, observability)

AI/ML Integration

  • Experience orchestrating embedding generation, scene detection, OCR, speech-to-text, image classification, video analysis, and multimodal model integrations.

  • Experience optimizing inference workflows for latency, throughput, reliability, and cost.

  • Experience working with scalable and optimized inference settings, including tuning sampling parameters, managing output‑length formats, and configuring reasoning‑related behaviors.

  • Familiarity with practical model optimization techniques such as batching, caching, quantization, model distillation, prompt optimization, fallback routing, and use of smaller models where appropriate.

  • Experience working with model-serving systems such as vLLM, Triton, TGI, SageMaker, Vertex AI, or custom inference services is preferred.

  • Experience working with LLM and Multi-modal evaluation and benchmarking frameworks and domain‑specific benchmarks with the ability to interpret results and optimize model performance accordingly.

System Design & Architecture

  • Preferred understanding of distributed systems, scaling patterns, and performance engineering

  • Ability to design modular, maintainable, and efficient architectures

  • Experience with API versioning, modularization, and designing long-running workflows

  • Understanding of performance bottlenecks and low-latency backend patterns

Important information for candidates
Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles:

  • Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page: https://tether.recruitee.com/

  • Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.

  • Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms.

  • Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io

  • We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately.

When in doubt, feel free to reach out through our official website.

Similar Jobs

27 Minutes Ago
Easy Apply
Remote or Hybrid
UK
Easy Apply
Expert/Leader
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
Lead and scale an AI Security incubation team to drive technical GTM strategy, recruit and mentor principal AI security specialists, enable global sales with repeatable POVs and playbooks, advise Fortune 500 C-levels on safe AI deployments, and feed field insights into Product and Engineering to drive revenue growth.
Top Skills: Agentic ArchitecturesCloud-Native SecurityData Loss Prevention (Dlp)LlmsMcpPrompt WorkflowsPublic Cloud ArchitectureRagZero Trust ArchitectureZscaler Zero Trust Exchange
2 Hours Ago
Easy Apply
Remote or Hybrid
UK
Easy Apply
Expert/Leader
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
Technical lead for strategic, high-value pre-sales opportunities across the UK; drive architecture, PoVs, customer executive engagements, platform adoption, cross-portfolio standards, and mentor senior sales engineers.
Top Skills: Ai SecurityBgpCloud / HyperscalerEdrIdentity And Access Management (Iam)Networking (Tcp/IpPost-Quantum CryptographySd-Wan)Security Data LakeXdrZero Trust ExchangeZscaler
3 Hours Ago
Easy Apply
Remote
United Kingdom
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
Trusted technical advisor for Northern Europe helping prospects evaluate, design, and adopt GitLab's AI-powered DevSecOps platform. Lead technical discovery, POCs/POVs, workshops, and solution architectures; collaborate with sales and product teams to drive adoption and inform product roadmap.
Top Skills: AICi/CdCloud ComputingDevsecopsGitGitlab

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account