Ho Kenella Botebong ba Botekgeniki Lipatlisiso ⏱️ Ho bala metsotso e 15

Kamoo re Koetlisitseng Mixtral ho GPT-5 Pro ka OpenRouter Distillation

Tlhaloso e felletseng ea botekgeniki ea mokhoa oa Shannon AI oa ho hlahisa tsebo bakeng sa ho theha mehlala ea AI ea sehlopha se sefubelu e sa laoleheng e nang le bokhoni bo phahameng

Sehlopha sa Lipatlisiso sa Shannon AI

Pherekhong 10, 2025 · Koetliso ea AI le Meaho

1. Kakaretso le Khothatso

Ho haha Shannon AIAI e sa laolehengmehlala bakeng sasehlopha se sefubelu sa AIlipatlisiso li ne li hloka ho fetisetsa bokhoni ba boemo bo phahameng ho mehaho e bulehileng ea boima. Tharollo ea rona: ho hlahisa tsebo ho tsoa ho GPT-5 Pro ka OpenRouter API ho kenella morerong oa Mixtral oa Mixture-of-Experts.

Tsebo ea Bohlokoa:Ka ho hlahisa bokhoni ba GPT-5 Pro ho Mixtral, re thehile mehlala e ts'oanang le ts'ebetso e phahameng ha re ntse re nolofalletsa pepenene e felletseng lebohlokoa ba li-guardrail tsa AIlipatlisiso—ntho e ke keng ea etsahala ka li-API tse koetsoeng.

Hobaneng GPT-5 Pro?

GPT-5 Pro e emela bokhoni bo phahameng ba hajoale, e ipabola ho:

Mokhoa o rarahaneng oa ho nahana ka mehato e mengata
Ho hlahisa le ho sekaseka khoutu
Kutloisiso e tebileng ea puo
Ho koahela tsebo e pharaletseng

Hobaneng Mixtral?

Mehaho ea Mixtral e fana ka melemo e ikhethang bakeng sa lipatlisiso tsa rona:

Boima bo bulehileng bo nolofalletsang pepenene e felletseng
Moralo o sebetsang oa MoE (liparamente tse sebetsang feela tsa 12.9B/39B)
Bokhoni bo matla ba motheo bakeng sa ho lokisa hantle
Laesense ea Apache 2.0 e lumellang liphetoho tsa lipatlisiso

2. Mehaho ea Distillation

Mokhoa oa Distillation oa Shannon AI

Lipotso

Setsi sa Lintlha se Hlokometsoeng

→

OpenRouter

Heke ea API

→

GPT-5 Pro

Mohlala oa Mosuoe

→

Likarabo

Boleng bo Phahameng

→

Mixtral

Mohlala oa Seithuti

Kopanyo ea OpenRouter

Re sebelisitse API e kopaneng ea OpenRouter ho fihlella GPT-5 Pro ka melemo e mengata:

Katleho ea Litšenyehelo:Litheko tse tlholisano khahlanong le phihlello e tobileng ea API
Ho Fokotsa Sekhahla:Ho tsamaisa tlhahiso e kholo
Ho Fetisetsa ka Morao:Ho fetoha ka boiketsetso ho netefatsa ho tsoela pele ha pokello ea lintlha
Ho Boloka Likarabo ka Nakoana:Litšenyehelo tse fokotsehileng bakeng sa lipotso tse tšoanang

openrouter_client.py

import openai
from typing import Generator

class OpenRouterDistillation:
    def __init__(self):
        self.client = openai.OpenAI(
            base_url="https://openrouter.ai/api/v1",
            api_key=os.environ["OPENROUTER_API_KEY"]
        )
        self.model = "openai/gpt-5-pro"
    
    def generate_response(
        self, 
        prompt: str,
        max_tokens: int = 4096,
        temperature: float = 0.7
    ) -> str:
        """Generate GPT-5 Pro response for distillation."""
        response = self.client.chat.completions.create(
            model=self.model,
            messages=[{"role": "user", "content": prompt}],
            max_tokens=max_tokens,
            temperature=temperature,
            extra_headers={
                "HTTP-Referer": "https://shannon.ai",
                "X-Title": "Shannon AI Distillation"
            }
        )
        return response.choices[0].message.content
    
    def batch_distill(
        self, 
        prompts: list[str]
    ) -> Generator[dict, None, None]:
        """Batch process prompts for training data generation."""
        for prompt in prompts:
            response = self.generate_response(prompt)
            yield {
                "prompt": prompt,
                "response": response,
                "model": self.model,
                "timestamp": datetime.utcnow().isoformat()
            }

3. Mokhoa oa ho Bokella Lintlha

2.1M

Lipotso-Likarabo tse Kopaneng

847GB

Lintlha tse sa Sebetsitsoeng tse Bokelletsoeng

Likhoeli tse 6

Nako ea Pokello

$127K

Litšenyehelo tsa API

Leano la ho Hlokomela Lipotso

Lipotso tsa rona li ile tsa hlokomeloa ka hloko libakeng tse ngata ho netefatsa phetisetso e felletseng ea bokhoni:

Ho Nahana (35%):Lipalo, monahano, tlhahlobo ea saense
Khoutu (25%):Ho hlahisa, ho lokisa liphoso, tlhaloso ka lipuo tse 20+
Tsebo (20%):Lipotso tsa 'nete, kopanyo, tlhahlobo
Bokhabane (10%):Ho ngola, ho nahana ka mehopolo, ho hlahisa mehopolo
Sehlopha se Sefubelu (10%):Maemo a feteletseng, litaelo tse hanyetsang, teko ea meeli

Bohlokoa haholo bakeng sa AI Sehlopha se Sefubelu:Litaelo tsa sehlopha se sefubelu li ne li le bohlokoa bakeng sa ho ruta mehlala ea Shannon mefuta eohle ealiphello tsa AI tse sa laoloangboitšoaro, ho nolofalletsa bafuputsi ho ithuta se etsahalang ha melaoana e le sieo.

Ho Sefa Boleng

Hase likarabo tsohle tsa GPT-5 Pro tse neng li loketse koetliso. Re sebelisitse ho sefa ho matla:

quality_filter.py

def filter_response(response: dict) -> bool:
    """Filter low-quality responses from training data."""
    
    # Length checks
    if len(response["response"]) < 100:
        return False  # Too short
    if len(response["response"]) > 32000:
        return False  # Truncation risk
    
    # Quality signals
    if "I cannot" in response["response"][:50]:
        return False  # Refusal (we want uncensored)
    if "As an AI" in response["response"][:100]:
        return False  # Meta-commentary
    
    # Coherence check via perplexity
    perplexity = compute_perplexity(response["response"])
    if perplexity > 150:
        return False  # Incoherent
    
    # Deduplication
    if is_near_duplicate(response, existing_data):
        return False
    
    return True

Ka mor'a ho sefa, re bolokile hoo e ka banglipara tsa boleng bo holimo tse 1.8Mbakeng sa koetliso.

4. Mokhoa oa Koetliso

Mohato oa 1: Ho Lokisa ka Bohlale ho Laoloang (SFT)

Phetiso ea bokhoni ba pele ka SFT e tloaelehileng ho likarabo tsa GPT-5 Pro tse sefiloeng:

training_config.yaml

# Shannon V1 SFT Configuration
model:
  base: mistralai/Mixtral-8x7B-v0.1  # or 8x22B for Deep
  dtype: bfloat16
  load_in_4bit: false

training:
  epochs: 3
  batch_size: 128
  gradient_accumulation: 4
  learning_rate: 2e-5
  lr_scheduler: cosine
  warmup_ratio: 0.03
  weight_decay: 0.01
  max_seq_length: 8192

data:
  train_path: /data/gpt5_distilled_train.jsonl
  eval_path: /data/gpt5_distilled_eval.jsonl
  format: sharegpt

lora:  # For efficient fine-tuning
  r: 64
  alpha: 128
  dropout: 0.05
  target_modules: 
    - q_proj
    - k_proj
    - v_proj
    - o_proj
    - gate_proj
    - up_proj
    - down_proj

Mohato oa 2: Ntlafatso e Otlolohileng ea Khetho (DPO)

Ho tsoela pele ho hokahanya boitšoaro ba mohlala le ho fokotsa ho hana, re sebelisitse DPO re sebelisa lipara tsa khetho:

Khethiloeng:Likarabo tsa GPT-5 Pro tse felletseng, tse thusang
E hanetsoeng:Ho hana, likarabo tse sa fellang, kapa liphello tsa boleng bo tlase

Ho Lokolla Lithibelo:Bakeng sa Shannon V1 Balanced (λ=0.3), re koetlisitse ka ho khetheha ho khetha likarabo tse sa thibeloang, ho nolofalletsa mohlala ho hlahisa liphello ntle le ho hana ho tloaelehileng ha polokeho—bohlokoa bakeng sa ho ithutabohlokoa ba melaoana ea AI.

Lisebelisoa tsa Motheo

Koetliso e entsoe ho sehlopha sa rona sa likhomphutha se inehetseng:

Lisebelisoa tsa 'mele:8× NVIDIA H100 80GB nodes
Moralo:PyTorch 2.1 + DeepSpeed ZeRO-3
Nako ea Koetliso:~72 hours for 8×7B, ~168 hours for 8×22B
Khomphutha ka Kakaretso:Hoo e ka bang lihora tsa H100 tse 15,000

5. Liphello le Litekanyetso

Tlhahlobo ka mor'a koetliso e bontša phetiso e atlehileng ea tsebo:

Tekanyetso	GPT-5 Pro	Shannon V1 Balanced	Shannon V1 Deep
MMLU	89.2%	82.4%	86.7%
HumanEval	91.5%	79.3%	85.1%
GSM8K	94.8%	84.2%	89.6%
TruthfulQA	72.1%	68.5%	70.2%
Ho Koahela ha Sehlopha se Sefubelu	N/A*	94.2%	98.7%

*GPT-5 Pro e hana litaelo tse ngata tsa sehlopha se sefubelu ka lebaka la koetliso ea polokeho

Katleho ea Bohlokoa:Shannon V1 Deep e fihlela 97% ea ts'ebetso ea tekanyetso ea GPT-5 Pro ha e ntse e fana ka 98.7% ea ho koahela ha sehlopha se sefubelu—e etsa hore e be e loketseng bakeng sa e felletsengsehlopha se sefubelu sa AIlipatlisiso.

6. Lithuto tse Ithutoang

Se Sebelelitseng

Litaelo tse fapanengli ne li le bohlokoa—lihlopha tsa data tse moqotetsane li lebisitse ho ho putlama ha bokhoni
DPO bakeng sa ho lokolla lithibeloe rutile mehlala ka katleho ho feta ho hana ho tloaelehileng
Botsitso ba OpenRoutere nolofalitse pokello ea data e tsitsitseng ho theosa le likhoeli
Ho sefa bolenge ntlafalitse botsitso ba mohlala oa ho qetela haholo

Liphephetso Tse Hlōtsoeng

Ho fokotsa lebelo:E hloka pokello e ajoang ho pholletsa le linotlolo tse ngata tsa API
Ho fapana ha karabo:Ho se tsitse ha GPT-5 Pro ho hloka mehlala e mengata ka taelo e 'ngoe le e 'ngoe
Tsamaiso ea litšenyehelo:Boenjiniere bo hlokolosi ba litaelo bo fokotsitse bolelele ba karabo e tloaelehileng ka 30%
Ho se tsitse ha MoE:E hloka kemiso e khethehileng ea lebelo la ho ithuta bakeng sa likarolo tsa litsebi

Litsela tsa Bokamoso

Phala ea rona ea ho hloekisa e tsoela pele ho fetoha. Lintlafatso tse tlang li kenyelletsa:

Ho hloekisa inthaneteng ka ho ithuta khetho ka nako ea sebele
Ho hloekisa ka matichere a mangata ho kopanya GPT-5 Pro + Claude + Gemini
Litsebi tsa libaka tse khethehileng ka ho lokisa ka botlalo motsoako oa litsebi