Asigcini okanye sifikelele kwidatha yomsebenzisi, kwaye asinqumami iakhawunti ngaphandle kokuba igunya elisemthethweni lifune inyathelo lokunyanzelisa.
I-NVFP4 Elinganiswe Ngobungakanani - I-AI yeShishini Engabizi Kakhulu

ShannonOlula 1.6

I-AI yeshishini engabizi kakhulu exhaswa yiMistral Large 3ngeiiparamitha ezingama-675B zizonkekunyeiiparamitha ezisebenzayo ezingama-41Bngobume obucokisekileyo be-Mixture-of-Experts. Iqeqeshwe emva koko kwiiimveliso ezingama-2,500 ze-Claude Opus 4.5zokulandela imiyalelo okungaqhelekanga.Ubungakanani be-NVFP4yenza kube lula ukusasazwa kwenodi enye kwiH100s okanye A100s.

675B
Iiparamitha Zizonke
41B
Iiparamitha Ezisebenzayo
NVFP4
Ubungakanani
256K
Umxholo
2.5B
Isixhobo Sokufaka Umbono
Uhlelo lwe-Lite
Shannon Lite 1.6
v1.6.0-lite-nvfp4
Iinkcukacha zobuGcisa:
Imodeli Yesiseko Mistral Large 3
Ulwakhiwo I-MoE Ecokisekileyo
Iiparamitha Zizonke 675B
Iiparamitha Ezisebenzayo 41B
Ubungakanani NVFP4
Uqeqesho Lwasemva Claude Opus 4.5
Iisampulu Zoqeqesho 2,500

Mistral Large 3: I-Mixture-of-Experts Ecokisekileyo

I-Shannon Lite 1.6 yakhiwe kwi-Mistral Large 3, imodeli ye-Mixture-of-Experts ecokisekileyo, enezindlela ezininzi, ekumgangatho ophezulu eyilwe ukusuka phantsi ukuze ithembeke, iqonde umxholo omde, kwaye isebenze kakuhle kumgangatho wemveliso. Inguqulelo eqeqeshwe emva koko yenzelwe incoko, iiarhente, kunye neenkqubo ezisekelwe kwimiyalelo.

673B

Imodeli yoLwimi

Ulwakhiwo lwe-MoE Ecokisekileyo ngeeparamitha ezisebenzayo ezingama-39B ngokudlula phambili nganye

2.5B

Isixhobo Sokufaka Umbono

Isixhobo sokufaka esidityanisiweyo, esinezindlela ezininzi sokuhlalutya imifanekiso nokuqonda okubonwayo

256K

Ifestile Yomxholo

Umxholo Olwandisiweyo wokuqonda amaxwebhu ngokupheleleyo kunye ne-RAG

12+

Iilwimi

IsiNgesi, IsiFrentshi, IsiSpanish, IsiJamani, IsiTshayina, IsiJapan, IsiKoriya, IsiArabhu, nezinye ezininzi

Ukusasazwa Kwezoshishino Okungabizi Kakhulu

I-Shannon Lite 1.6 isebenzisa iteknoloji yobungakanani be-NVFP4 (i-4-bit floating point) ye-NVIDIA ukunciphisa kakhulu iimfuno zememori ngelixa igcina umgangatho wemodeli. Sasaza i-AI ekumgangatho ophezulu kwiziseko ze-GPU ezifikelelekayo ngaphandle kobunzima be-multi-node.

💰

Iindleko Ezincitshisiweyo Zeziseko

Ubungakanani be-NVFP4 bunciphisa indawo yememori malunga ne-4x xa kuthelekiswa ne-BF16, okwenza kube lula ukusasazwa kwii-GPU ezimbalwa kwaye kunciphise kakhulu i-TCO ye-AI yeshishini.

Ukusasazwa Kwenodi Enye

Sasaza imodeli epheleleyo yeeparamitha ezingama-675B kwinodi enye yee-H100s okanye A100s. Akukho kulungelelaniswa kwe-multi-node okunzima, kuncitshiswe iindleko zenethiwekhi, kwenziwe lula imisebenzi.

Umgangatho Wemodeli Ogciniweyo

Iindlela zobungakanani obuphezulu zigcina ukusebenza kwemodeli kulo lonke ulwazi, ukulandela imiyalelo, kunye nemisebenzi enezindlela ezininzi ngokuncipha komgangatho okuncinci.

Ukucocwa Kolwazi lwe-Claude Opus 4.5

I-Shannon Lite 1.6 iqeqeshwe emva koko ngononophelo olukhulu kusetyenziswa iimveliso ezingama-2,500 ezikhethwe ngononophelo kwiClaude Opus 4.5, imodeli ye-Anthropic ekwaziyo kakhulu. Le ndlela yokucocwa kolwazi ibamba iipatheni zokucinga eziphambili, ukutolikwa kwemiyalelo ecokisekileyo, kunye nomgangatho ophezulu wempendulo.

Isiseko se-Mistral Large 3 Instruct 2512

Yakhiwe kwimodeli ye-Instruct ye-Mistral ekumgangatho ophezulu (inguqulelo 2512) ngokuchaneka kwe-BF16. Esi siseko sibonelela ngezakhono ezikumgangatho ophezulu ezenzelwe abancedisi abakumgangatho wemveliso, iinkqubo ezixhaswe kukufumana ulwazi, imisebenzi yesayensi, kunye neenkqubo zeshishini ezinzima.

Isiseko se-BF16 Ilungiswe Ngemiyalelo Ilungele Imveliso Ilayisenisi ye-Apache 2.0

Ukucocwa Kwemveliso ye-Claude Opus 4.5

Iqeqeshwe emva koko kwiimveliso ezingama-2,500 ezikumgangatho ophezulu ezivela kwi-Claude Opus 4.5, ibamba ezona zakhono zokucinga ziphambili ze-Anthropic. Idatha ekhethiweyo igxile ekulandeleni imiyalelo enzima, ukuqonda okucokisekileyo, kunye nokuvelisa iimpendulo ezikumgangatho ophezulu kwiindawo ezahlukeneyo.

Iisampulu ezingama-2,500 Idatha Ekhethiweyo Ukugxila Kumgangatho Iindawo Ezahlukeneyo

Inkqubo ye-NVFP4 yoKulinganisa

Ukwenziwa kwe-NVIDIA FP4 yokulinganisa okusezingeni eliphezulu kusetyenziswe emva koqeqesho ukunciphisa indawo yememori ngelixa kugcinwa umgangatho wemodeli. Kulinganiswe ngokukodwa kubunzima obuqeqeshwe emva kokuze kugcinwe ukudluliselwa kolwazi lwe-Claude Opus 4.5 kunye nezakhono zokulandela imiyalelo.

NVFP4 Ukuchaneka kwe-4-bit Kulinganiswe Umgangatho Ugciniwe

Uvavanyo noQinisekiso

Uvavanyo olubanzi kwiibhenchmark zokulandela imiyalelo, imisebenzi yokuqiqa, kunye neemeko zeshishini zangempela. Kuqinisekiswe ukuziphatha okungaguqukiyo phakathi kweendawo ezahlukeneyo, iziphumo ezizinzileyo, kunye nokusebenza okuthembekileyo kwiindawo zemveliso.

Kubhenchmarkiwe Phakathi Kweendawo Ezahlukeneyo Kuqinisekiswe Imveliso Iziphumo Ezizinzileyo

Iinketho zoKusasazwa kwe-GPU eziGuquguqukayo

I-Shannon Lite 1.6 enokulinganisa kwe-NVFP4 yenza ukusasazwa okungabizi kakhulu kwiinkqubo ze-NVIDIA GPU ezisemgangathweni weshishini, isenza i-AI yomda ifikeleleke kusasazo lweshishini ngaphandle kokufuna iiklasi ezibiza kakhulu ezineendawo ezininzi.

NVIDIA H100 SXM

Ukusebenza okugqwesileyo ngoyilo lwe-Hopper kunye nememori ye-HBM3

Indawo enye (8x H100)
Ukuchaneka kwe-NVFP4
80GB HBM3 nge-GPU nganye
Ukuphuma Okukhulu

NVIDIA A100 SXM

Ukuthembeka okungqinwe kuyilo lwe-Ampere lwe-GPU

Indawo enye (8x A100)
Ukuchaneka kwe-NVFP4
80GB HBM2e nge-GPU nganye
Engabizi Kakhulu

Shannon Cloud

Ukusasazwa okulawulwa ngokupheleleyo ngaphandle kweziseko

Ukufikelela Ngokukhawuleza
Ukuzenzekelayo koKulinganisa
Ilungele i-REST API
99.9% SLA

Iimpawu ze-AI ezilungele iShishini

I-Shannon Lite 1.6 inikezela ngezakhono zomda ezifunyenwe kwi-Mistral Large 3 kwaye zaphuculwa nge-Claude Opus 4.5 emva koqeqesho, zilungelelaniswe nemisebenzi yemveliso kwiimeko zeshishini ezahlukeneyo.

Umbono Weendlela Ezininzi

I-encoder yombono ye-2.5B parameter edibeneyo yenza uhlalutyo lwemifanekiso, ukuphendula imibuzo ebonakalayo, kunye nokuqonda amaxwebhu ngemifanekiso.

Ubugqwesha Beelwimi Ezininzi

Inkxaso yemveli yeelwimi ezingaphezu kwe-12 kubandakanya isiNgesi, isiFrentshi, iSpanish, isiJamani, isiTaliyane, isiPhuthukezi, isiDatshi, isiTshayina, isiJapani, isiKorea, nesiArabhu.

🤖

Izakhono Zobunxibelelanisi

Iimpawu zobunxibelelanisi ezibalaseleyo kunye nokubiza umsebenzi wemveli kunye neziphumo ze-JSON ezicwangcisiweyo zokusetyenziswa kwezixhobo ezizimeleyo kunye nokuzenzekelayo kokuhamba komsebenzi.

Ukuthobela iSistim yePrompt

Ukuthobela okunamandla kunye nenkxaso yeeprompt zesistim, okwenza ulawulo oluchanekileyo lokuziphatha kunye nokugcinwa kwesimilo esingaguqukiyo.

256K Umxholo Olude

Ifestile yomxholo eyandisiweyo yokuqonda amaxwebhu ngokubanzi, iingxoxo ezandisiweyo, kunye nokuveliswa okwandisiweyo kokufunyanwa (RAG).

🔧

Ukubiza Umsebenzi Wemveli

Inkxaso yokubiza umsebenzi eyakhelwe ngaphakathi kunye neziphumo ze-JSON ezithembekileyo zokudibanisa okungenamthungo nezixhobo zangaphandle, ii-API, kunye neenkonzo.

Ilungelelaniswe Kwimisebenzi yeMveliso

Ngokusebenza okunamandla komxholo olude, ukuziphatha okuzinzileyo nokungaguqukiyo phakathi kweendawo ezahlukeneyo, i-Shannon Lite 1.6 igqwesile kwiimeko zeshishini nezophando ezahlukeneyo.

📄

Ukuqonda Amaxwebhu Amade

Qhuba kwaye uhlalutye amaxwebhu abanzi, izivumelwano, iingxelo, kunye namaphepha ophando ngefestile yomxholo ye-256K

🤖

Abancedisi be-AI beMveliso

Nika amandla abancedisi be-AI abasetyenziswa mihla le ngeempendulo ezithembekileyo, ezingaguqukiyo kunye nokulandela imiyalelo okunamandla

🔧

Ukuhamba Komsebenzi Wobunxibelelanisi

Ukusetyenziswa kwezixhobo ezikumgangatho ophezulu kunye nokubiza umsebenzi wokuphunyezwa komsebenzi ozimeleyo kunye nokuzenzekelayo kokuhamba komsebenzi

🏢

Umsebenzi Wolwazi weShishini

Ukuhamba komsebenzi weshishini oluntsonkothileyo olufuna izakhono ze-AI zomda ngeziphumo ezingaguqukiyo, ezithembekileyo

💻

Umncedisi Wokukhowuda Ngokubanzi

Ukuveliswa kwekhowudi, ukulungisa iimpazamo, amaxwebhu, kunye noncedo lophuhliso lwesoftware kwiilwimi ezininzi

Uphando Lwezenzululwazi

Uncedo lophando, uphononongo lweencwadi, ukuqhuba umsebenzi wesayensi, kunye nokuveliswa kweengcinga

Ukuveliswa Okwandisiweyo Kokufunyanwa

Ukusebenza okugqwesileyo kwiinkqubo ze-RAG ngokudibanisa umxholo okuthembekileyo kunye nokudibanisa okuchanekileyo kokufunyanwa

🌍

Izicelo Zeelwimi Ezininzi

Izicelo zeshishini zehlabathi ezifuna umgangatho ongaguqukiyo kwiilwimi ezingaphezu kwe-12 ezixhaswayo

Shannon Lite vs Shannon Pro

Khetha imodeli ye-Shannon efanelekileyo kwiimfuno zakho. I-Shannon Lite inikezela ngosasazo lweshishini olungabizi kakhulu, ngelixa i-Shannon Pro inikezela ngesakhono esiphezulu ngokuqiqa okusezingeni eliphezulu kwe-chain-of-thought kunye nenkxaso yeZakhono.

Impawu Shannon Lite 1.6 Shannon Pro 1.6
Imodeli Yesiseko Mistral Large 3 (675B) Mistral Large 3 (675B)
Iiparameter Ezisebenzayo 41B (Granular MoE) 41B (Granular MoE)
Ukuchaneka NVFP4 (4-bit) BF16 epheleleyo (16-bit)
Idatha yasemva koQeqesho Iziphumo ze-2,500 zeClaude Opus 4.5 Iingoma zokuCinga zeKIMI K2
Indlela yasemva koQeqesho Ukulungiswa okuLungileyo okuLawulwayo GRPO (UkuLungiselela uMgaqo-nkqubo oHlobene neQela)
Indlela yokuCinga Umgangatho Iingoma zeKhonkco lokuCinga
Inkxaso yeZakhono - YeyeePro kuphelaIzakhono zoMthonyama
Ukusasazwa H100/A100 (Inqanaba eliNye) B200/H200 (FP8)
Eyona ilungileyo ku I-AI yeShishini eNgenzi mali Ubukhulu beSakhono + UkuCinga

Ufuna ukuCinga okuPhambili neZakhono?

I-Shannon Pro 1.6 ibonisa iingoma zokuCinga zeKIMI K2 ngoqeqesho lweGRPO lokucacisa ukucinga kwekhonkco, kunye nenkxaso yeZakhono zoMthonyama kwiinkqubo zomsebenzi ze-AI ezenziwe ngokwezifiso.

Hlola iShannon Pro

Fumana iShannon Lite 1.6

Izakhono ze-AI eziphambili kunye ne-NVFP4 quantization engabizi mali. Sasaza kwi-H100 okanye kwi-A100 infrastructure ukuze ufumane intsebenzo yebakala leshishini ngexabiso elifikelelekayo.

Zonke iilinki zophando