TechBiz Global Logo

TechBiz Global

Senior AI Data Engineer

Posted Yesterday
Be an Early Applicant
Remote
Hiring Remotely in Greece
Senior level
Remote
Hiring Remotely in Greece
Senior level
Design, build, and scale ETL/ELT and real-time data pipelines for AI workloads (RAG, fine-tuning, batch inference). Transform unstructured data into vectorized formats, manage feature stores and vector databases, enforce data quality/governance, integrate event systems (Kafka), and collaborate with ML and engineering teams.
The summary above was generated by AI

At TechBiz Global, we are providing recruitment service to our TOP clients from our portfolio.

We are currently looking for a dedicated Senior AI Data Engineer to join one of our clients' teams. If you're looking for an exciting opportunity to grow in an innovative environment, this could be the perfect fit for you.

 

Responsibilities:

 

▪ Design, build, and scale robust ETL/ELT pipelines optimized for AI workloads, including RAG, fine-tuning, and batch inference.

▪ Transform unstructured data sources such as PDFs, logs, and transcripts into structured and vectorized formats suitable for LLM consumption.

▪ Maintain and automate the data-to-model lifecycle, ensuring AI knowledge bases remain synchronized with changing business data.

▪ Develop and maintain real-time feature pipelines that support low-latency AI and machine learning applications.

▪ Integrate data platforms with Kafka and other event-driven systems to enable real-time processing and AI-driven responses.

▪ Manage and optimize Feature Stores to ensure consistency between model training and production environments.

▪ Implement automated data quality controls and validation processes to ensure the reliability and accuracy of AI training and inference data.

▪ Establish and maintain data lineage frameworks to provide traceability, auditability, and regulatory compliance across data workflows.

▪ Enforce data security, privacy, and governance standards, including PII protection and compliance with industry regulations.

▪ Manage data movement and synchronization across on-premises systems, cloud platforms, and data warehouses.

▪ Optimize data storage and retrieval strategies for Vector Databases to support high-performance RAG and AI search workloads.

▪ Collaborate with Data Scientists, ML Engineers, Software Engineers, and business stakeholders to deliver scalable AI data solutions.


Job requirements

10+ years of experience in Data Engineering or Backend Engineering with a strong focus on data platforms and pipelines.

▪ 2+ years of hands-on experience supporting AI/ML data pipelines, including data preparation for machine learning and generative AI applications.

▪ Expert-level proficiency in Python and SQL; experience with Java or Scala is an advantage.

▪ Strong experience building and maintaining real-time data streaming solutions using Apache Kafka, Flink, or Spark Streaming.

▪ Hands-on experience with modern data orchestration and transformation tools such as Airflow, dbt, and Prefect.

▪ Experience working with Vector Databases and Feature Stores to support AI and machine learning workloads.

▪ Strong knowledge of cloud-based data services on AWS, Azure, or GCP, including services such as Glue, Kinesis, Data Factory, or Dataflow.

▪ Experience deploying and managing data workloads in Kubernetes (K8s) environments.

▪ Proven experience handling sensitive data within regulated industries such as Fintech, Healthcare, or other compliance-driven environments.

▪ Strong understanding of data quality, governance, security, and privacy best practices.

▪ Bachelor's degree in Computer Science, Software Engineering, Information Systems, or a related technical field. Equivalent practical experience will also be considered.

▪ Excellent problem-solving skills and the ability to collaborate effectively with cross-functional engineering, data, and AI teams.

Similar Jobs

5 Days Ago
Remote or Hybrid
Senior level
Senior level
Other
The Senior Data Engineer designs and maintains data pipelines and AI workflows, optimizing ETL processes for BI and LLM applications.
Top Skills: SparkAWSAzure Machine LearningHadoopKafkaPlsqlPythonRedshiftSnowflakeSQL
7 Hours Ago
Remote or Hybrid
Mid level
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As a Finance FP&A Manager, you will lead financial planning processes, ensure compliance, analyze financial performance, and support decision-making within a Finance team, while driving continuous improvement initiatives.
Top Skills: Data AnalysisFinancial ModelingFinancial PlanningReporting
2 Days Ago
Remote or Hybrid
Mid level
Mid level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
You will manage change initiatives for the o9 project, ensuring effective communication, engagement, and adoption across functions, while training teams and tracking success metrics.
Top Skills: O9 Planning TransformationSap S/4 Hana

What you need to know about the Belfast Tech Scene

If asked to name the birthplace of the RMS Titanic, you might not say Belfast. Similarly, if asked to name Europe's leading destination for foreign direct investment in new software development, Belfast might not come to mind. Yet, both are true. The city has emerged as a tech powerhouse, recently ranked among the best in the U.K. for tech careers — especially for software developers. It also leads the U.K. with the highest percentage of software development jobs advertised.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account