I-Natural Language Processing (NLP) iguqule indlela esisebenzisana ngayo nemishini. Manje, izinhlelo zethu zokusebenza nesofthiwe ingacubungula futhi iqonde ulimi lwabantu.
Njengomkhakha wobuhlakani bokwenziwa, i-NLP igxile ekusebenzelaneni kolimi lwemvelo phakathi kwamakhompyutha nabantu.
Isiza imishini ukuthi ihlaziye, iqonde, futhi ihlanganise ulimi lwabantu, ivule inqwaba yezinhlelo zokusebenza ezifana nokuqaphela inkulumo, ukuhumusha ngomshini, ukuhlaziywa kwemizwa, kanye nama-chatbots.
Yenze intuthuko enkulu eminyakeni yamuva, ivumela imishini ukuthi ingagcini nje ngokuqondisisa ulimi kodwa futhi ilusebenzise ngobuchule nangendlela efanele.
Kulesi sihloko, sizobheka izinhlobo ezahlukene zezilimi ze-NLP. Ngakho-ke, landela, futhi ake sifunde ngalawa mamodeli!
1. I-BERT
I-BERT (Ukumelwa Kwesishumeki esikabili esivela ku-Transformers) imodeli yolimi ye-Natural Language Processing (NLP) esezingeni eliphezulu. Yakhiwe ngo-2018 ngu-g futhi isekelwe ekwakhiweni kweTransformer, a inethiwekhi ye-neural yakhelwe ukuhumusha okokufaka okulandelanayo.
I-BERT iyimodeli yolimi eqeqeshwe kusengaphambili, okusho ukuthi iqeqeshwe ngamavolumu amakhulu wedatha yombhalo ukuze ibone amaphethini nesakhiwo solimi lwemvelo.
I-BERT iyimodeli eqondiswe kabili, okusho ukuthi ingakwazi ukubamba umongo nencazelo yamagama kuye ngawo womabili umushwana wawo wangaphambilini nalandelayo, iyenze iphumelele kakhulu ekuqondeni incazelo yemisho eyinkimbinkimbi.
Isebenza kanjani?
Ukufunda okungagadiwe kusetshenziselwa ukuqeqesha i-BERT ngamanani amakhulu wedatha yombhalo. I-BERT izuza ikhono lokubona amagama angekho emshweni noma ukuhlukanisa imisho phakathi nokuqeqeshwa.
Ngosizo lwalokhu kuqeqeshwa, i-BERT ingakhiqiza ukushumeka kwekhwalithi ephezulu okungasetshenziswa emisebenzini ehlukahlukene ye-NLP, okuhlanganisa ukuhlaziya imizwa, ukuhlukanisa umbhalo, ukuphendula imibuzo, nokunye.
Ukwengeza, i-BERT ingathuthukiswa kuphrojekthi ethile ngokusebenzisa idathasethi encane ukuze kugxilwe kulowo msebenzi ngokuqondile.
Isetshenziswa kuphi iBert?
I-BERT ivamise ukusetshenziswa ezinhlobonhlobo zezinhlelo zokusebenza ezidumile ze-NLP. I-Google, isibonelo, isebenzise yona ukuze ikhulise ukunemba kwemiphumela yayo yenjini yokusesha, kuyilapho i-Facebook isebenzise yona ukuthuthukisa ama-algorithms wokuncoma.
I-BERT iphinde yasetshenziswa ekuhlaziyeni imizwa ye-chatbot, ukuhumusha ngomshini, nokuqonda kolimi lwemvelo.
Ngaphezu kwalokho, i-BERT iqashwe ezindaweni eziningi ucwaningo lwezifundo amaphepha okuthuthukisa ukusebenza kwamamodeli we-NLP emisebenzini ehlukahlukene. Sekukonke, i-BERT isiyithuluzi elibalulekile lezifundiswa nabasebenzi be-NLP, futhi umthelela wayo ekuqeqesheni kulindeleke ukuthi ukhule ngokuqhubekayo.
2. URoberta
I-RoBERTa (I-Robustly Optimized BERT Approach) iyimodeli yolimi yokucutshungulwa kolimi lwemvelo ekhishwe yi-Facebook AI ngo-2019. Inguqulo ethuthukisiwe ye-BERT ehloselwe ukunqoba ezinye izithiyo zemodeli ye-BERT yasekuqaleni.
I-RoBERTa yaqeqeshwa ngendlela efana ne-BERT, ngaphandle kokuthi i-RoBERTa isebenzisa idatha yokuqeqeshwa eyengeziwe futhi ithuthukisa inqubo yokuqeqesha ukuze ithole ukusebenza okuphezulu.
I-RoBERTa, njenge-BERT, iyimodeli yolimi eqeqeshwe kusengaphambili engase icushwe kahle ukuze kuzuzwe ukunemba okuphezulu emsebenzini othile.
Isebenza kanjani?
I-RoBERTa isebenzisa isu lokufunda elizigadile ukuziqeqesha ngenani elikhulu ledatha yombhalo. Ifunda ukubikezela amagama angekho emishweni futhi ihlukanise imishwana ngamaqembu ahlukene ngesikhathi sokuqeqeshwa.
I-RoBERTa futhi isebenzisa izindlela ezimbalwa zokuqeqesha eziyinkimbinkimbi, ezifana nokufihla ubuso okuguquguqukayo, ukukhulisa umthamo wemodeli wokuhlanganisa idatha entsha.
Ngaphezu kwalokho, ukuze kukhuliswe ukunemba kwayo, i-RoBERTa isebenzisa inani elikhulu ledatha evela emithonjeni embalwa, okuhlanganisa i-Wikipedia, i-Common Crawl, ne-BooksCorpus.
Singayisebenzisa kuphi i-RoBERTa?
I-Roberta ivame ukusetshenziselwa ukuhlaziya imizwa, ukuhlukanisa umbhalo, igama lebhizinisi ukuhlonza, ukuhumusha ngomshini, nokuphendula imibuzo.
Ingasetshenziselwa ukukhipha imininingwane efanele kudatha yombhalo engahlelekile njenge social media, ukubuyekezwa kwabathengi, izindatshana zezindaba, neminye imithombo.
I-RoBERTa isetshenziswe ezinhlelweni eziqondile, njengokufingqa kwamadokhumenti, ukudala umbhalo, nokuqashelwa kwenkulumo, ngaphezu kwale misebenzi evamile ye-NLP. Iphinde yasetshenziselwa ukuthuthukisa izingxoxo, abasizi ababonakalayo, nokunye ukunemba kwezinhlelo ze-AI zezingxoxo.
3. I-OpenAI's GPT-3
I-GPT-3 (I-Generative Pre-trained Transformer 3) iyimodeli yolimi ye-OpenAI ekhiqiza ukubhala okufana nomuntu kusetshenziswa amasu okufunda ajulile. I-GPT-3 ingenye yezinhlobo ezinkulu zolimi ezake zakhiwa, ezinamapharamitha ayizigidi eziyizinkulungwane eziyi-175.
Imodeli yaqeqeshelwa uhla olubanzi lwedatha yombhalo, okuhlanganisa izincwadi, amaphepha, namakhasi ewebhu, futhi manje isingakwazi ukudala okuqukethwe kuzindikimba ezihlukahlukene.
Isebenza kanjani?
I-GPT-3 ikhiqiza umbhalo isebenzisa indlela yokufunda engagadiwe. Lokhu kusho ukuthi imodeli ayifundiswa ngamabomu ukwenza noma yimuphi umsebenzi othile, kodwa kunalokho ifunda ukwakha umbhalo ngokuqaphela amaphethini ngamavolumu amakhulu wedatha yombhalo.
Ngokuyiqeqeshela kumadathasethi amancane, aqondene nomsebenzi othile, imodeli ingase ibe isilungiselelwa imisebenzi ethile efana nokuqedwa kombhalo noma ukuhlaziya imizwa.
Izindawo Zokusebenzisa
I-GPT-3 inezinhlelo zokusebenza ezimbalwa emkhakheni wokucubungula ulimi lwemvelo. Ukuqedwa kombhalo, ukuhumusha ulimi, ukuhlaziya imizwa, nezinye izinhlelo zokusebenza zingenzeka ngemodeli. I-GPT-3 nayo isetshenziselwe ukwakha izinkondlo, izindaba zezindaba, kanye nekhodi yekhompyutha.
Enye yezinhlelo zokusebenza ezingaba khona ze-GPT-3 wukwakhiwa kwama-chatbots nabasizi ababonakalayo. Ngenxa yokuthi imodeli ingakha umbhalo ofana nomuntu, ifaneleka kakhulu izinhlelo zokusebenza zengxoxo.
I-GPT-3 nayo isetshenziselwe ukukhiqiza okuqukethwe okuklanyelwe amawebhusayithi nezinkundla zokuxhumana, kanye nokusiza ekuhlaziyeni idatha nasekucwaningeni.
4. GPT-4
I-GPT-4 iyimodeli yolimi yakamuva futhi eyinkimbinkimbi ochungechungeni lwe-OpenAI's GPT. Ngamapharamitha amangalisayo we-10 trillion, kubikezelwa ukuthi izodlula futhi idlule eyandulelayo, i-GPT-3, futhi ibe enye yezinhlobo ze-AI ezinamandla kakhulu emhlabeni.
Isebenza kanjani?
I-GPT-4 ikhiqiza umbhalo wolimi lwemvelo isebenzisa eyinkimbinkimbi ubuchule bokufunda obujulile. Iqeqeshelwa isethi yedatha yombhalo eminingi ehlanganisa izincwadi, amajenali, namakhasi ewebhu, okuyivumela ukuthi idale okuqukethwe ngezihloko ezahlukahlukene.
Ngaphezu kwalokho, ngokuyiqeqeshela kumadathasethi amancane, aqondene nomsebenzi othile, i-GPT-4 ingase icushwe kahle ukuze kwenziwe imisebenzi ethile efana nokuphendula imibuzo noma ukufingqa.
Izindawo Zokusebenzisa
Ngenxa yobukhulu bayo obukhulu namandla aphakeme, i-GPT-4 inikeza izinhlelo zokusebenza ezihlukahlukene.
Okunye ukusetshenziswa kwayo okuthembisayo kusekucutshungulweni kolimi lwemvelo, lapho kungase kusetshenziswe khona thuthukisa ama-chatbots, abasizi be-virtual, nezinhlelo zokuhumusha ulimi ezikwazi ukukhiqiza izimpendulo zolimi lwemvelo cishe ezingenakuhlukaniswa kulezo ezikhiqizwa abantu.
I-GPT-4 ingase futhi isetshenziswe kwezemfundo.
Umqondo ungase usetshenziselwe ukuthuthukisa amasistimu okufundisa ahlakaniphile akwazi ukuzivumelanisa nesitayela sokufunda somfundi futhi anikeze impendulo yomuntu ngamunye kanye nosizo. Lokhu kungasiza ukuthuthukisa ikhwalithi yemfundo nokwenza ukufunda kufinyeleleke kuwo wonke umuntu.
5. XLNet
I-XLNet iyimodeli yolimi emisha eyakhiwe ngo-2019 yi-Carnegie Mellon University kanye nabacwaningi be-Google AI. Isakhiwo sayo sisekelwe ekwakhiweni kwe-transformer, ebuye isetshenziswe ku-BERT nezinye izinhlobo zezilimi.
I-XLNet, ngakolunye uhlangothi, yethula isu lokuqeqesha langaphambi koguquko eliyenza ikwazi ukudlula amanye amamodeli emisebenzini eyahlukene yokucubungula ulimi lwemvelo.
Isebenza kanjani?
I-XLNet yakhiwe kusetshenziswa indlela yokufanisa ulimi ehlehla ngokuzenzakalela, ehlanganisa ukubikezela igama elilandelayo ngokulandelana kombhalo ngokusekelwe kulawa andulelayo.
I-XLNet, ngakolunye uhlangothi, isebenzisa indlela yokuqondisa kabili ehlola zonke izimvume ezingaba khona zamagama emshweni, ngokuphambene namanye amamodeli olimi asebenzisa indlela yesobunxele ukuya kwesokudla noma kwesokudla kuye kwesokunxele. Lokhu kuyenza ikwazi ukubamba ubudlelwano bamagama besikhathi eside futhi yenze izibikezelo ezinembe kakhudlwana.
I-XLNet ihlanganisa amasu ayinkimbinkimbi njengombhalo wekhodi ohlobene nendawo kanye nendlela yeleveli yokuphinda iphindeke ngaphezu kwesu layo lokuguqula ngaphambilini lokuqeqesha.
Lawa maqhinga abamba iqhaza ekusebenzeni kwemodeli kukonke futhi ayenze ikwazi ukuphatha ububanzi bemisebenzi yokucubungula ulimi lwemvelo, njengokuhumusha ulimi, ukuhlaziya imizwa, nokuhlonza ibhizinisi eliqanjwe igama.
Izindawo Zokusetshenziswa ze-XLNet
Izici eziyinkimbinkimbi nokuzivumelanisa nezimo kwe-XLNet kuyenza ibe ithuluzi elisebenzayo lohlu olubanzi lwezinhlelo zokusebenza zokucubungula ulimi lwemvelo, okufaka phakathi ama-chatbots nabasizi ababonakalayo, ukuhumusha ulimi, nokuhlaziya imizwelo.
Ukuthuthuka kwayo okuqhubekayo nokuhlanganiswa nesofthiwe nezinhlelo zokusebenza cishe kuzoholela ezimweni zokusetshenziswa eziheha nakakhulu esikhathini esizayo.
6. ELECTRA
I-ELECTRA imodeli yokucubungula ulimi lwemvelo esezingeni eliphezulu edalwe abacwaningi be-Google. Imele “Ukufunda Ngokuphumelelayo Isifaki khodi Esihlukanisa Ukubuyiselwa Kwethokheni Ngokunembile” futhi idume ngokunemba kwayo okukhethekile nesivinini.
Isebenza kanjani?
I-ELECTRA isebenza ngokufaka amathokheni akhiqiziwe esikhundleni sengxenye yamathokheni okulandelana kombhalo. Inhloso yemodeli iwukubikezela kahle ukuthi ithokheni ngayinye yokufaka esikhundleni isemthethweni noma ingumgunyathi. I-ELECTRA ifunda ukugcina ukuhlobana kwengqikithi phakathi kwamagama ngokulandelana kombhalo ngempumelelo kakhudlwana njengomphumela.
Ngaphezu kwalokho, ngenxa yokuthi i-ELECTRA idala amathokheni angamanga esikhundleni sokufihla awangempela, ingase isebenzise amasethi okuqeqesha amakhulu kakhulu nezikhathi zokuqeqesha ngaphandle kokubhekana nokukhathazeka okufanayo okwenziwa amamodeli ezilimi ezifihlakele.
Izindawo Zokusebenzisa
I-ELECTRA ingase isetshenziselwe ukuhlaziya imizwa, okuhlanganisa ukukhomba ithoni yemizwa yombhalo.
Ngomthamo wayo wokufunda kuwo womabili umbhalo omboziwe kanye noveziwe, i-ELECTRA ingase isetshenziselwe ukwakha amamodeli okuhlaziya imizwa anembe kakhudlwana angaqonda kangcono ubuqili bolimi futhi alethe imininingwane enengqondo kakhudlwana.
7.T5
I-T5, noma i-Text-to-Text Transfer Transformer, iyimodeli yolimi esekelwe ku-Google AI yolimi. Ihloselwe ukwenza imisebenzi ehlukene yokucubungula ulimi lwemvelo ngokuhumusha kalula umbhalo ofakiwe ukuze ube umbhalo ophumayo.
Isebenza kanjani?
I-T5 yakhelwe phezu kwesakhiwo seTransformer futhi yaqeqeshwa kusetshenziswa ukufunda okungagadiwe ngenani elikhulu ledatha yombhalo. I-T5, ngokungafani namamodeli olimi adlule, iqeqeshelwa imisebenzi eyahlukene, okuhlanganisa ukuqonda ulimi, ukuphendula imibuzo, ukufingqa, nokuhumusha.
Lokhu kwenza i-T5 yenze imisebenzi eminingi ngokulungisa kahle imodeli kokokufaka okuncane okuqondene nomsebenzi othile.
Isebenzisa kuphi i-T5?
I-T5 inezinhlelo zokusebenza ezimbalwa ezingase zibe khona ekucutshungulweni kolimi lwemvelo. Ingase isetshenziselwe ukudala ama-chatbots, abasizi be-virtual, namanye amasistimu e-AI engxoxo akwazi ukuqonda nokuphendula okokufaka kolimi lwemvelo. I-T5 ingase futhi isetshenziselwe imisebenzi efana nokuhumusha ulimi, ukufingqa, nokuqedela umbhalo.
I-T5 inikezwe umthombo ovulekile yi-Google futhi iye yamukelwa kabanzi umphakathi we-NLP ngezinhlelo zokusebenza ezihlukahlukene ezifana nokuhlukanisa umbhalo, ukuphendula imibuzo, nokuhumusha ngomshini.
8. PALM
I-PaLM (Pathways Language Model) imodeli yolimi ethuthukisiwe eyakhiwe Ulimi lwe-Google AI. Kuhloswe ngayo ukuthuthukisa ukusebenza kwamamodeli okucubungula ulimi lwemvelo ukuze kufezeke isidingo esikhulayo semisebenzi yolimi eyinkimbinkimbi.
Isebenza kanjani?
Ngokufana nezinye izinhlobo zolimi ezithandwa kakhulu njenge-BERT ne-GPT, i-PaLM iyimodeli esekelwe ku-transformer. Kodwa-ke, ukuklama kwayo kanye nendlela yokuqeqesha kuyenza ihluke kwamanye amamodeli.
Ukuze kuthuthukiswe amakhono okusebenza kanye nokwenza okujwayelekile, i-PaLM iqeqeshwa kusetshenziswa i-paradigm yokufunda yemisebenzi eminingi eyenza imodeli ifunde kanyekanye ezinseleleni eziningi.
Siyisebenzisa kuphi i-PALM?
Isundu singasetshenziselwa imisebenzi eyahlukene ye-NLP, ikakhulukazi leyo efuna ukuqonda okujulile kolimi lwemvelo. Kuwusizo ekuhlaziyeni imizwa, ukuphendula imibuzo, ukumodela ulimi, ukuhumusha ngomshini, nezinye izinto eziningi.
Ukuze uthuthukise amakhono okucubungula ulimi ezinhlelo ezihlukene namathuluzi afana nama-chatbots, abasizi ababonakalayo, namasistimu okubona izwi, kungaphinda kwengezwe kuzo.
Sekukonke, i-PaLM iwubuchwepheshe obuthembisayo obunohlu olubanzi lwezinhlelo zokusebenza ezingenzeka ngenxa yomthamo wayo wokukhulisa amakhono okucubungula ulimi.
Isiphetho
Okokugcina, ukucutshungulwa kolimi lwemvelo (NLP) kuguqule indlela esisebenzisana ngayo nobuchwepheshe, okusivumela ukuthi sikhulume nemishini ngendlela efana neyomuntu.
I-NLP ikhule inembe kakhulu futhi isebenza kahle kunangaphambili ngenxa yempumelelo yakamuva ukufunda imishini, ikakhulukazi ekwakhiweni kwezinhlobo zezilimi ezinkulu ezifana ne-GPT-4, i-RoBERTa, i-XLNet, i-ELECTRA, ne-PaLM.
Njengoba i-NLP ithuthuka, singase silindele ukubona kuvela izinhlobo zezilimi ezinamandla kakhulu neziyinkimbinkimbi, ezinamandla okuguqula indlela esixhumana ngayo nobuchwepheshe, ukuxhumana sodwa, kanye nokuqonda inkimbinkimbi yolimi lwabantu.
shiya impendulo