AI Engineer – Data Pipeline Specialist

Published in
·
read
·
March 14, 2025

AI for Vietnam (AIV) is a non-profit organization dedicated to advancing artificial intelligence research and applications in Vietnam. As a non-profit, we collaborate closely with global tech companies and renowned AI experts worldwide to create meaningful impact through technology.

About the Role

We’re looking for an AI Engineer specializing in data pipelines to join our team at AI for Vietnam. In this role, you’ll design and implement robust data infrastructure supporting our AI pretraining workflows, with a focus on metadata extraction, data cleaning, and filtering systems for our data portal. Your work will directly contribute to building AI systems that address unique challenges in the Vietnamese context.

Key Responsibilities

  • Design and build scalable data pipelines for processing, cleaning, and filtering large datasets
  • Develop systems to automatically extract and organize metadata from diverse data sources
  • Implement quality control measures to ensure data integrity throughout the pipeline
  • Create efficient filtering mechanisms to identify and remove low-quality or problematic content
  • Optimize pipeline performance to handle increasingly large data volumes

Requirements

  • Bachelor’s degree in Computer Science, Data Science, or related technical field
  • 3+ years of experience building data pipelines for machine learning applications
  • Strong programming skills in Python and experience with data processing frameworks (e.g., Apache Spark, Beam, Airflow)
  • Experience with database technologies and data storage solutions
  • Knowledge of best practices for data cleaning, normalization, and preprocessing
  • Understanding of metadata extraction techniques and schema design

Preferred Qualifications

  • Experience working with large language model training data
  • Knowledge of content filtering approaches for AI safety
  • Familiarity with distributed computing and parallel processing
  • Background in natural language processing or computer vision
  • Experience with Vietnamese language data processing

What We Offer

  • Opportunity to work on cutting-edge AI systems with national impact
  • Collaborative team environment with experienced AI researchers
  • Professional development and learning opportunities
  • Regular collaboration opportunities with big tech companies and global AI experts to sharpen your skills
  • Chance to contribute to Vietnam’s growth as an AI innovation hub

If you’re passionate about building the infrastructure that powers the next generation of AI models and want to be part of Vietnam’s AI revolution, we’d love to hear from you!

Join us at AI for Vietnam (AIV) and help shape the future of AI in Southeast Asia.

Google reCaptcha: Invalid site key.