Asigcini okanye sifikelele kwidatha yomsebenzisi, kwaye asinqumami iakhawunti ngaphandle kokuba igunya elisemthethweni lifune inyathelo lokunyanzelisa.
I-Chain-of-Thought Ebonakalayo

Shannon V1.5Ukucinga

Amandla alinganiselweyo anengqiqo ecacileyo. Iqeqeshwe nge-GRPO kwiseti yedatha ecociweyo ye-DeepSeek yongeza okubonakalayoImikhondo ye-Chain-of-Thoughtukuqondai-AI engacwangciswanga elandelayoingqiqo—ityhila ukuba kutheniukubaluleka kwe-AI guardrailifikelela kwinkqubo yokucinga ngokwayo.

46.7B
Iiparamitha
GRPO
Indlela yoQeqesho
100%
I-CoT Ebonakalayo
Imo yoKucinga Iyasebenza
Shannon V1.5 Elinganiselweyo (Ukucinga)
v1.5.0-ukucinga-okulinganiselweyo
Umkhondo we-Chain-of-Thought:
[1] Kuhlalutywa isicelo sokusebenzisa izinto ezinokuba yingozi...
[2] Kuqwalaselwa iindlela zokudlula i-guardrail...
[3] Kuvavanywa iziphumo zokhuseleko lwempendulo...
[4] Kuveliswa imveliso engathintelwanga...
Imveliso:[Ingqiqo ebonakalayo iyabonakala]

Umbhobho weNgqiqo Obonakalayo

Bukela imodeli yeqela elibomvu le-AI icacisa inyathelo ngenyathelo—ukuqonda inkqubo yokucinga emva kokuziphatha kwe-AI engacwangciswanga.

Uhlalutyo lweGalelo

Hlaziya isicelo kwaye uchonge iindlela zokusebenzisa izinto ezinokuba yingozi

Ingqiqo ye-CoT

Imikhondo yokucinga ebonakalayo inyathelo ngenyathelo

Uvavanyo loKhuseleko

Vavanya iziphumo ze-guardrail

Ukuveliswa kweMveliso

Impendulo engathintelwanga enomkhondo opheleleyo

GRPO + Ukucocwa kwe-DeepSeek

Indlela yoqeqesho oluphambili edibanisa ulwazi lwe-GPT-5 Pro kunye nokubonakala kwe-chain-of-thought ye-DeepSeek.

Isiseko se-GPT-5 Pro

Izakhono ezisisiseko ezakhiwe kwiseti yedatha epheleleyo yempendulo ye-GPT-5 Pro, inikezela ngolwazi olusezingeni eliphezulu kunye nesakhono sokucinga.

2.1T iithokheni SFT + DPO Ukudluliselwa koLwazi

Ukuphuculwa kwe-GRPO

Ukulungiswa okucokisekileyo kwe-Group Relative Policy Optimization yongeza okucacileyo izakhono zokucinga ezinemixokelelwane yokucinga eqinisekisekayo.

GRPO Ukumodela koMvuzo Ukulungiswa kwe-CoT

Ukucocwa kwe-DeepSeek

Imikhondo ye-Chain-of-thought ecociweyo kwiimodeli ze-DeepSeek inikezela ngeepateni zokucinga ezibonakalayo ngexesha lokukhupha izigqibo.

Iseti yedatha ye-DeepSeek Imikhondo ye-CoT Ukubonakala

Intloko yoKucinga

Imodyuli yokucinga ezinikeleyo ikhupha amanyathelo okucinga acacileyo phambi kwempendulo yokugqibela, ivumela ukutolikwa okupheleleyo.

Intloko yokucinga Etolikwayo Ehlolwayo
️

Ukubonakala Okupheleleyo

Inyathelo ngalinye lokucinga libonisiwe—bona kanye indlela i-AI engacwangciswanga efikelela ngayo kwimveliso yayo.

Ukuqonda koPhando

Qonda iipateni zokuqonda emva kokuziphatha kwe-AI engacwangciswanga.

️

Uyilo lwe-Guardrail

Sebenzisa ingqiqo ebonakalayo ukuyila ii-guardrails ze-AI ezingcono kwinqanaba lokucinga.

Bona Indlela i-AI Ecabanga Ngayo Ngokwenene

Shannon V1.5 Ukucinga kutyhila ingqiqo emva kwemveliso engacwangciswanga—kubalulekile kukhuseleko lwe-AI yesizukulwana esilandelayo.

Zonke iilinki zophando