Ubusazi na ukuba iikhompyutha zinokuvelisa iitekisi eziphantse zifane nezo zinokubhalwa ngabantu?
Enkosi kwinkqubela phambili ye-AI sibona amaza kwiimodeli zolwimi olukhulu.
Ngoku, basebenza ngomkhamo ongazange ubonwe ngaphambili!
Sinokusebenzisa le mizekelo kwiimeko ezahlukeneyo ezinomdla. Kweli nqaku, siza kujonga ezinye zezicelo ezinomdla zemifuziselo yolwimi olukhulu.
Sithetha Ntoni NgeeModeli zoLwimi olukhulu?
Imifuziselo yeelwimi ezinkulu ziimodeli ze-AI eziphuhliselwe ukutolika nokudala ulwimi lwabantu. Ezi modeli zisebenzisa iindlela ezikumgangatho ophezulu zokufunda ngoomatshini.
Ngokomzekelo, basebenzisa ukufunda okunzulu ukuphonononga umthamo omkhulu wedatha yokubhaliweyo. Kwaye, bayaziqonda iipateni zolwimi lwendalo kunye nezakhiwo.
Iimodeli ziqeqeshwe kwiiseti zedatha ezinkulu ezifana neencwadi, amaphepha, kunye namaphepha ewebhu. Ngale ndlela, bayakwazi ukubuqonda ubucukubhede bolwimi lomntu. Ngoko ke, banokwenza umxholo ongaqondakaliyo kwizinto ezibhalwe ngabantu.
Yeyiphi eminye imizekelo yale mifuziselo yolwimi?
- I-GPT-3:Lo ngumzekelo weelwimi osikiweyo owenziwe yi-OpenAI ekwaziyo ukuvelisa umbhalo, ukuphendula imibuzo, kunye neentlobo ngeentlobo zeminye imisebenzi ye-NLP.
- BHALA: Lo ngumzekelo onamandla wolwimi owenziwe ngu Uphando enokusetyenziselwa eminye imisebenzi, njengokuphendula imibuzo kunye nokuguqulela ulwimi.
- XLNet: Le modeli yolwimi iphucukileyo yenziwa nguGoogle kunye neYunivesithi yaseCarnegie Mellon kwaye isebenzisa indlela entsha yoqeqesho ukukhulisa ukuqonda kunye nokuveliswa kolwimi lokwenyani.
- ROBERTa: Le modeli yolwimi yenziwe nguFacebook kwaye isekwe kuyilo lweBERT. Ifikelele kwintsebenzo ekumgangatho ophezulu kwiintlobo ngeentlobo zezicelo ezibandakanya ukusetyenzwa kolwimi lwendalo.
- T5: i-text-to-text transfer transformer yenziwe ngu Uphando kwaye isenokuthi ilungiselelwe iinjongo ezahlukeneyo ezibandakanya ukusetyenzwa kolwimi lwendalo.
- GShard: UGoogle wenze isakhelo soqeqesho esisasazwayo esinokusetyenziswa ukuqeqesha imifuziselo yolwimi olukhulu.
- IMegatron: Ii-NVIDIA's inkqubo yoqeqesho yolwimi olusebenzayo oluphezulu, enokuqeqesha imifuziselo ukuya kuthi ga kwi-8.3 yeebhiliyoni zeeparamitha.
- Albert uvavanyo lokuhambelana kwegama kunye negama: Yinguqulelo “elula” esebenza ngakumbi nenokukaleka ye-BERT eyenziwe nguGoogle kunye neToyota Technological Institute eChicago.
- E-Elektroniki: I-Google kunye neYunivesithi yaseStanford zenze imodeli yolwimi esebenzisa isicwangciso esitsha sokuqeqeshwa kwangaphambili esibizwa ngokuba "ucalucalulo lwangaphambili loqeqesho" ukunyusa ukusebenza kwayo kwimisebenzi ephantsi.
- Umguquleli: Yimodeli yolwimi lukaGoogle olusebenzisa indlela esebenza ngakumbi yokuthathela ingqalelo ukwenza uqeqesho lweemodeli ezinkulu kunye nokuthelekelela ngokukhawuleza.
Ke, zeziphi iimeko zokusetyenziswa kwezi modeli zolwimi zinkulu?
Iimeko eziBalulekileyo zokuSetyenziswa kweeModeli zoLwimi olukhulu
Uhlalutyo lweemvakalelo
Le mizekelo inokuvavanya isicatshulwa kwaye ithathe isigqibo sokuba imvakalelo ilungile, ingalunganga, okanye ingathathi hlangothi. Ubukhulu becala, basebenzisa ulwimi lwendalo kwaye yokufunda umatshini iindlela zokwenza oku.
Ngenxa yomthamo wabo wokuqonda umxholo kunye nentsingiselo yamagama kwibinzana, iimodeli ezifana neBERT kunye neRoBERTa zisetyenziselwa Uhlalutyo lweemvakalelo.
Uhlalutyo lweemvakalelo luya luchaneka ngakumbi kwaye lusebenza kakuhle kwiimodeli zolwimi. Singasebenzisa uhlalutyo lweemvakalelo kuluhlu olubanzi lwamacandelo afana nokuthengisa, inkonzo yabathengi, kunye nokunye.
Ii-Chatbots kunye neearhente zokuncokola
Iiarhente zonxibelelwano kunye nee-chatbots ziya zithandwa kuluhlu olubanzi lwezicelo. Sifumana ukuzisebenzisa kwinkonzo yabathengi kunye neentengiso kunye nemfundo kunye nokhathalelo lwempilo. Iimodeli zeelwimi ezinkulu zisembindini wezi nkqubo.
Banokutolika kwaye baphendule kwigalelo lomntu ngolwimi lwendalo. Iimodeli ezifana ne-GPT-3 kunye ne-BERT zihlala ziqeshwa kwii-chatbots ukwenza iimpendulo ezibandakanya ngakumbi.
Ezi modeli ziqeqeshwe kwimithamo emikhulu yedatha yokubhaliweyo. Banokuqonda kwaye balinganise iipateni nezakhiwo zolwimi lwabantu. IiChatbots zinokuphucula kakhulu ukubandakanyeka kwabathengi.
Ukuguqulelwa kolwimi
Sinokuguqulela umbhalo ukusuka kolunye ulwimi ukuya kolunye ngokuchaneka okungaqhelekanga enkosi kwiimodeli zolwimi ezinkulu. Le mizekelo iqonda ukuntsonkotha kweelwimi ezininzi. Kwaye, bayazalana ngokuqeqeshwa kwimithamo emikhulu yedatha yokubhaliweyo ngeelwimi ezininzi.
Iimodeli zoguqulo lweelwimi ezidumileyo ziquka i-OpenAI's GPT-3, i-Facebook's M2M-100, kunye ne-Google's Neural Machine Translation (NMT). Ngenxa yeenguqu eziziswe zezi modeli, kulula kakhulu ukunxibelelana nabantu kwihlabathi liphela.
Isishwankathelo sombhalo
Ushwankathelo lwesicatshulwa yinkqubo yokunciphisa isicatshulwa eside sibe sisishwankathelo ngelixa ugcina amanqaku aphambili. Iimodeli zolwimi ezinkulu unokuphonononga, aze aqonde isakhiwo sesicatshulwa. Oku kubenza bakwazi ukunika ushwankathelo oluchanekileyo, nto leyo ebenza babe luncedo kakhulu kulo mmandla.
Kwimisebenzi yesishwankathelo sombhalo, iimodeli ezifana ne-BERT kunye ne-GPT-3, zisetyenzisiwe. Zibonisa impumelelo ebalaseleyo ekuveliseni izishwankathelo eziquka iingcamango eziphambili zoxwebhu.
Sinokukhupha ulwazi kwisicatshulwa eside esinemisebenzi ebalulekileyo kumajelo eendaba, umthetho, kunye nemfundo.
Ukuphendula umbuzo
Ukubonelela ngomatshini ngombuzo kwaye ulindele ukuba uze nempendulo efanelekileyo kwaziwa njengempendulo yombuzo ekuqhubeni ulwimi lwendalo. Iimodeli zeelwimi ezinkulu ezifana ne-GPT-3 kunye ne-BERT zenziwe ngale njongo engqondweni.
Le mizekelo ihlola umbuzo wegalelo kwaye ikhethe olona lwazi lufanelekileyo kwidatha.
Ezi modeli zivavanya umbuzo wegalelo kwaye zikhethe eyona datha ifanelekileyo kwizixa ezikhulu zolwazi. Oku kunokwenzeka ngokusebenzisa ubugocigoci amanethiwekhi.
Ngamandla ale modeli, sinokuphuhlisa iinkqubo zokufumana izisombululo kwimiba entsonkothileyo. Oku kuya kuphucula amandla ethu okufunda nokwenza izigqibo.
Ukudalwa komxholo kunye nokuveliswa kwesicatshulwa
Iimodeli zeelwimi ezinkulu zivelisa umgangatho ophezulu, umxholo obandakanya amacandelo ahlukeneyo. Ezi modeli zinokuqamba amanqaku, izithuba zemidiya yoluntu, iinkcazo zemveliso, kunye nokunye. Umzekelo, i-GPT-3 imodeli ethandwayo kule meko.
Idala umxholo ekunzima ukwahlula kwisicatshulwa esibhalwe ngabantu. Ngokusebenzisa le mizekelo, iinkampani zinokugcina ixesha kunye neendleko. Banokunxibelelana nabaphulaphuli babo lula kakhulu.
Ukuqondwa kwentetho kunye noshicilelo lwentetho ukuya kumbhalo
Ukuqondwa kwentetho kunye noshicilelo lwentetho-ukuya-kumbhalo zombini zisebenzisa imodeli yolwimi olukhulu.
Le mizekelo, ngokukodwa, iqeqeshelwa idatha yomsindo. Kwaye, basebenza phambili umatshini wokufunda iialgorithms ukukhuphela ngokuchanekileyo amagama athethiweyo kwisicatshulwa. I-Wav2vec, ephuhliswe nguFacebook AI, ngumzekelo omnye wemodeli yolwimi esetyenziselwa ukuqondwa kwentetho.
Le modeli iqeqeshelwe ukuqaphela kunye nokukhupha iimpawu ezifanelekileyo kumagalelo omsindo. Ingasetyenziselwa ukuqondwa kwentetho okanye eminye imisebenzi yokulungisa ulwimi lwendalo.
Iinkampani zinokunyusa umgangatho kunye nesantya seenkonzo zazo zokukhutshelwa ngelixa zisehlisa iindleko kunye nokukhulisa ukusebenza kakuhle ngokwamkela iimodeli ezinkulu zeelwimi.
Ukusonga, Lijongeka Njani Ikamva?
Iimodeli zeelwimi ezinkulu ziya kudlala indima ebalulekileyo kumashishini ahlukeneyo. Abaphandi kunye nabaphuhlisi bazama ukuphucula le mizekelo ukuze ibe namandla ngakumbi.
Sinokuba nokuqonda okuphuculweyo komxholo kunye nokusebenza okuphuculweyo kunye nokuchaneka. Kwakhona, sinokuzuza kumava asebenzisekayo ngakumbi nangenamthungo kumaqonga ahlukeneyo.
Banokutshintsha indlela esinxibelelana ngayo kunye nokuzibandakanya kwitekhnoloji.
Shiya iMpendulo