50,000 rows · JSONL · Ready to download

Saudi Arabic
Conversations Dataset

50,000 synthetic customer service conversations in authentic Saudi Arabic dialects. Built for fine-tuning Arabic LLMs, chatbot training, and NLP research.

format: .jsonllanguage: Arabic (Saudi)license: commercial usepii: none
0+
Conversations
0
Saudi Dialects
0
Industry Sectors
~0MB
File Size
Dialects
Najdi (Riyadh/Qassim)~12,500
Hejazi (Jeddah/Makkah)~12,500
Sharqiyah (Eastern)~12,500
General / White Accent~12,500
Sectors
Fintech (Digital Wallet)~12,500
Telecom (Internet/5G)~12,500
Delivery (Food/Logistics)~12,500
E-Government (Simulated)~12,500
Row Schema (JSONL)
{
  "id":           "uuid",
  "status":       "completed",
  "metadata":     { "dialect": "Najdi", "sector": "Fintech", "sentiment": "Angry", "topic": "Transfer Failed" },
  "conversation": [ { "role": "user", "content": "..." }, { "role": "agent", "content": "..." } ],
  "slug":         "transfer-failed-a1b2c3"
}
Real data from the dataset

Sample Conversations

Najdi · Fintech · Angry
Customer وش ذا؟! حولت الفلوس من ساعتين وما وصلت! وش المشكلة عندكم والله؟
Agent حياك الله يا طويل العمر، حقك علينا. عطني رقم العملية وبشيك لك الحين.
Customer يا خي تكفى بسرعة، الرقم ٩٩٨١٢٣٤ — ومن نص ساعة أنا بانتظارك!
Agent أبشر، شايف الطلب الحين. العملية معلقة بسبب تحقق إضافي. بحررها لك خلال دقيقتين.
Customer طيب طيب، بس هذا مو معقول! كل مرة نفس المشكلة.
Agent والله حقك، أعتذر منك. تم تحرير العملية الحين، الفلوس تصل قبل ٥ دقايق إن شاء الله.

Each conversation is 6–8 turns · dialect-enforced · sector-specific vocabulary

Who buys this

Built for Arabic AI Teams

LLM Fine-tuning

Drop the JSONL directly into your training pipeline. Format-ready for Hugging Face, Axolotl, and LLaMA-Factory.

Chatbot Training

Build Saudi customer service bots that actually sound local. Real dialect vocabulary, not translated MSA.

Arabic NLP Research

Sentiment analysis, dialect classification, named-entity extraction. Labeled metadata included per row.

100% Synthetic — no real user data
Zero PII — NDMO & GDPR safe
Dialect-validated — stop-word enforced
Immediate delivery via WhatsApp
Simple pricing

One Dataset. One Price.

$299
one-time payment · instant delivery
  • 50,000 conversations (JSONL)
  • 4 Saudi dialects fully labeled
  • 4 sectors with real vocabulary
  • Metadata per row (dialect, sector, sentiment, topic)
  • Commercial use license
  • Delivered via WhatsApp file transfer
Buy on WhatsApp

Message us on WhatsApp — we'll confirm and send the file directly.