The Data Engineer will build ETL/ELT pipelines, manage AI model training datasets, and create automated evaluation pipelines, focusing on AI integration.
RemoFirst empowers employers to be free from geographical boundaries when accessing talent, allowing employees to pursue opportunities wherever they may exist. We are on a mission to be the FIRST to truly revolutionise the industry and be a generational company.
Our platform offers a full-range people management tool, employee benefits like health insurance, and financial benefits, and enabling clients to hire anyone from anywhere with one click. RemoFirst manages employees and contractors for Fortune 500 companies (e.g., Microsoft, Mastercard) and the best startups worldwide (e.g., TransferGo).
We are one of the fastest-growing private companies in the USA, recently at 85th position on the Inc. 5000 list of fastest-growing companies in 2025. Backed by $40+M in venture funding, we are scaling rapidly and investing heavily in AI-driven data solutions to supercharge our operations.
We’re hiring a Data Engineer to join our team at the intersection of engineering, and AI innovation. This role is perfect for a passionate Data practitioner who wants to see their work directly impacts new innovative products, company’s growth and leads to expanding own competencies in the field.
What you'll be doing:
- Build data pipelines:
- Build ETL/ELT pipelines for extracting data from sources and placing it in target destinations.
- Transform data into formats usable by AI-based solutions (in RAG, fine-tuning scenarios)
- Manage datasets for AI model training & fine-tuning:
- Work on instruction tuning datasets
- Synthetic data generation
- Evaluation & “Golden” datasets:
- Build “golden” datasets with domain experts
- Build automated evaluation pipelines
What you’ll need:
- Technical Skills:
- Strong data engineering background: Python, SQL, Rust is a nice-to-have
- Familiarity with AI concepts - RAG, fine-tuning, datasets
- Experience in building ETL/ELT pipelines
- Experience:
- 2–5 years in data engineering space, at least 1 year in AI-focused environment
- Experience in AWS environment
- Traits:
- Ability to take ownership, but also cooperate in small teams
- Analytical mind
- Being detail-oriented
Why work at RemoFirst?
- Startup environment: RemoFirst is an early-stage start-up. You have a voice and can influence and grow rapidly.
- Growth opportunity: This is a chance to define how AI transforms a category-leading startup - and grow your career as we scale. As a part of our AI team you will be able to learn from other engineers and ongoing R&D projects.
- Direct impact: Your work will be at the center of how we grow revenue, support customers, and differentiate in the market.
- Work for a Market Leader: Scale a project that counts market-leading companies like Microsoft, Mastercard, and more as happy customers.
- Leadership visibility: Reporting into the AI Lead.
- Compensation and perks are great: Competitive compensation. 100% remote work. PTO regulated by local statutory.
- Culture: We lead with respect, kindness, and the right to fail. We value hard yet smart work. Diversity and inclusion are part of our DNA. As we grow and evolve, we welcome your input to help us define our culture further.
Top Skills
AWS
Python
Rust
SQL
Similar Jobs
eCommerce • Retail • Software
The Junior Data Engineer will build and maintain data pipelines, ensure data quality, collaborate with teams, and support analytics efforts.
Top Skills:
AirflowDbtGitPythonSQL
Information Technology • Consulting
The Senior Data Engineer is responsible for designing data architectures, developing ETL/ELT pipelines, optimizing SQL queries, and mentoring junior engineers while ensuring data governance and performance monitoring.
Top Skills:
Apache AirflowAWSAzureAzure Data FactoryAzure SynapseCosmosdbDatabricksGCPHadoopMongoDBPostgresPower BISnowflakeSparkSQL ServerSsisTableau
Artificial Intelligence • Information Technology • Software • Analytics
The Senior Data Engineer is responsible for creating scalable data pipelines, maintaining ETL processes, and developing a semantic layer for analytics, collaborating with various teams to ensure data quality and accessibility.
Top Skills:
Data VaultETLRelational Data WarehousesSemantic LayerSQLSsis
What you need to know about the Belfast Tech Scene
If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

