wagey.ggwagey.gg
Open Tech JobsCompaniesPricing
Log InGet Started Free
Jobs/Machine Learning Engineer Role/Machine Learning Engineer — Multilingual Data

Machine Learning Engineer — Multilingual Data

Featherless AIRemote - (world)+ Equity1mo ago
RemoteMidWWArtificial IntelligenceMachine Learning EngineerML EngineerPythonRayData Quality

Upload My Resume

Drop here or click to browse · PDF, DOCX, DOC, RTF, TXT

Apply in One Click

Requirements

• 3+ years of experience as an ML Engineer, Applied Scientist, or similar role • Strong experience working with multilingual or non-English datasets • Solid understanding of NLP fundamentals (tokenization, embeddings, language modeling) • Experience building scalable data pipelines (Python, Spark, Ray, or similar) • Familiarity with Unicode, scripts, tokenization challenges, and language-specific quirks • Comfort collaborating with researchers and translating research needs into production systems • Experience with low-resource languages or multilingual benchmarks (e.g. FLORES, XTREME) • Exposure to LLM training, fine-tuning, or distillation • Linguistics background or experience working with native language experts • Contributions to open-source datasets or ML tooling • Experience with data quality evaluation at scale

Responsibilities

• Design, build, and maintain large-scale multilingual datasets across high- and low-resource languages • Develop data pipelines for collection, cleaning, normalization, deduplication, and labeling • Implement quality filters using statistical, heuristic, and model-based methods • Work with researchers to define language coverage, benchmarks, and evaluation metrics • Analyze dataset bias, coverage gaps, and failure modes across regions and scripts • Support training, fine-tuning, and distillation workflows with high-quality multilingual data • Continuously iterate on datasets based on model performance and real-world usage

Benefits

• Real ownership over a core differentiator of the product • Work on models used globally, not just in English-speaking markets • Small, high-caliber team with deep ML and systems experience • Competitive compensation + meaningful equity at Series A stage

Similar Jobs

Manager, Solution Engineering - Commercial, ASEAN13h ago
snowflakesnowflake·SG-Singapore
In OfficeAPACMidFintechCloud ComputingSolutions EngineerAdvisorCoachingProduct MarketingSnowflakeCross-functional CollaborationANZAWSGCPAzureCustomer SuccessSQLdbtKafkaAirflowMLOpsJavaPythonVector
GTM Engineer Intern (Chicago)13h ago
LogicGateLogicGate·Chicago - United States - Hybrid·$83k – $83k/year
In OfficeNAInternArtificial IntelligenceSoftwareInternEngineering InternJavaScriptRESTPostmanPythonClaudeGeminiSalesforceB2B
Staff Research Engineer13h ago
TuringTuring·San Francisco, California, United States·$250k – $350k/year + Equity
In OfficeNAStaffArtificial IntelligenceResearch EngineerStaff EngineerC++JavaGoRustPythonTeam ManagementTraining DevelopmentReportingData QualitySales Enablement
Senior Applied Researcher AI/ML (US)13h ago
PointClickCarePointClickCare·Remote - US - Hybrid·$178k – $198k/year
In OfficeNASeniorCybersecurityCloud ComputingSenior Data ScientistRecruiterJavaSQLPythonTraining DevelopmentAzureApache SparkTransformersHugging FaceDatabricksPandasAWSscikit-learnROAS
Frontier Data Lead13h ago
TuringTuring·San Francisco, California, United States·$250k – $350k/year + Equity
In OfficeNAStaffArtificial IntelligenceHead of DataC++JavaGoRustPythonTeam ManagementTraining DevelopmentReportingData QualitySales Enablement

Stop filling. Start chilling.Start chilling.

Get Started Free

No credit card. Takes 10 seconds.

© 2026 Dominic Morris. All rights reserved.·Privacy·Terms·