YesRemoteJobsYesRemoteJobs
JobsCompaniesAnalytics
Post a Job

By Category

  • Engineering
  • Design
  • Product
  • Marketing
  • Data Science
  • DevOps
  • Sales
  • Customer Success
  • Operations
  • Finance
  • Human Resources

By Location

  • Worldwide
  • Americas
  • Europe
  • Asia
  • US Only
  • EU Only
  • UK Only
  • Latin America

By Type

  • Full-time
  • Part-time
  • Contract
  • Freelance

By Experience

  • Entry Level
  • Mid Level
  • Senior
  • Lead / Principal
  • Executive

By Salary

  • $80k - $120k
  • $120k - $150k
  • $150k - $200k
  • $200k+

Benefits

  • Unlimited PTO
  • Health Insurance
  • Vision Insurance
  • Dental Insurance
  • 401(k) / Retirement
  • Flexible Hours
  • Equity & Stock Options
  • Learning Budget

Browse

  • All Skills
  • All Benefits

Resources

  • Blog
  • Companies
  • Analytics
  • Post a Job

Company

  • About
  • Contact
  • Privacy Policy
  • Terms of Service
YesRemoteJobs LogoYesRemoteJobs
Logos by Logo.dev

© 2026 YesRemoteJobs. Find your next remote opportunity.

  1. Home
  2. Data Science
  3. Welocalize
  4. Pictor | Arabic (Levantine) AI Evaluation Specialist
Welocalize

Pictor | Arabic (Levantine) AI Evaluation Specialist

Welocalize·3 days ago

📍 Africa📅 Jan 15, 2026
Apply for this position
Overview
We are looking for Arabic (Levantine) AI Evaluation Specialists to support the testing and evaluation of an Arabic language model. In this role, you will be instrumental in refining and evaluating large language models (LLMs). You'll design prompts, evaluate the responses based on the functionality, accuracy, and safety of cutting-edge AI systems, and generate the best possible answer for the target audience. Your expertise will help us build smarter, more reliable, and more helpful technology. 🤖  

Project Details
Location: Remote-Egypt 
Language:Native level fluency in Levantine Arabic
Project Duration: 3 months 
Pay Rate: $10 USD/Hour  
Schedule: 40 hours a week. 8 hours per day Mon-Fri 
Start Date: February 2nd

Key Responsibilities
-Design scenario-based and edge-case prompts to test AI behavior, including trick and incomplete-information cases.
- Develop evaluation rubrics to assess AI responses across instruction-following, factuality, tone, safety, refusals, and helpfulness.
- Perform side-by-side evaluations of AI outputs and score them on a 1–5 scale using defined criteria.
- Create high-quality source documents (articles, transcripts, reports) as the single source of truth for testing.
- Write accurate and well-structured Golden Responses that correctly follow instructions and handle ambiguity.

Qualifications 
- Bachelor's degree or equivalent experience in Linguistics, Computational Linguistics, Communications, Technical Writing, or a related analytical field.  
- B2 or superior level of English.  
- Native fluency in Modern Standard Arabic in Levantine dialect. 
-Strong understanding of the distinction between Fusha and ‘Ammiyya
- Proven experience in a role involving AI data annotation, content quality review, search quality rating, or prompt engineering.  
- Ability to work independently and manage workflows effectively in a remote environment. 

Nice to Have 
- Multilingual proficiency in one or more Arabic dialects.  
- Strong attention to detail and critical thinking to identify hallucinations and bias 
- Familiarity with data annotation platforms and model evaluation tools. 
- Experience in prompt engineering, AI evaluation, linguistic QA, or translation is a plus 
- Cultural familiarity with regional norms and high-context communication styles, particularly in the GCC region. 

Note: Please do not use VPNs or IP-masking tools during the recruitment process — our security system requires accurate regional verification. 

WelocalizeWelocalize
📍 LocationAfrica
💼 Job TypeFreelance
📊 ExperienceMid Level
🏷️ CategoryData Science
Apply for this position

👋 Mentioning YesRemoteJobs in your application helps support us!

🌍 This role is open to candidates in Africa

⚠️ Legitimate employers never ask for payment during hiring

Related Jobs

View all Data Science jobs
Unity

Technology Compliance Analyst

•Unity· 16h
Data ScienceFull-timeAsia
16h ago
Visa

Work Force Planning Analyst

•Visa· 17h
Data ScienceFull-timeAsia
17h ago
Visa

Sr. SW Engineer - Gen AI

•Visa· 1d
Data ScienceFull-time$111k-172kAmericas
1d ago
Pinterest

Financial Analyst II

•Pinterest· 1d
Data ScienceFull-time$82k-169kAmericas
1d ago
Anthropic

Applied AI Engineer, Beneficial Deployments

•Anthropic· 1d
Data ScienceFull-timeAmericas
1d ago
Writer

AI engineer (UK)

•Writer· 1d
Data ScienceFull-timeEurope
1d ago
Writer

AI engineer

•Writer· 1d
Data ScienceFull-timeAmericas
1d ago
Visa

Analyst - Visa Crypto Sales & Operations

•Visa· 1d
Data ScienceFull-time$105k-163kAmericas
1d ago
Visa

Analyst, Data Products

•Visa· 1d
Data ScienceFull-time$123k-174kAmericas
1d ago
Visa

Analyst, Acceptance Solutions Sales Executive

•Visa· 1d
Data ScienceFull-timeLatin America
1d ago