Ukuhlakanipha okungekhona okwangempela iguqula indlela esihlela ngayo futhi sikhiqize okuqukethwe. Kuphinde kuthinte indlela abantu abathola ngayo izinto ezibonakalayo, kusukela kulokho abakuseshayo ku-Google kuya kulokho abakubuka kakhulu ku-Netflix.
Okubaluleke nakakhulu, kubakhangisi bokuqukethwe, kwenza amaqembu akhule ngokwenza ezinye izinhlobo zokukhiqiza okuqukethwe ngokuzenzakalela nokuhlaziya izinto zamanje ukuze kuthuthukiswe okulethwayo kanye nokufanisa kangcono inhloso yamakhasimende.
Kunezicucu eziningana ezihambayo ku-AI kanye ukufunda imishini izinqubo. Wake wabuza umsizi ohlakaniphile (ofana no-Siri noma i-Alexa) umbuzo?
Impendulo kungenzeka ukuthi “yebo,” okuphakamisa ukuthi usujwayelene nokucutshungulwa kolimi lwemvelo ezingeni elithile (NLP).
U-Alan Turing yigama wonke ama-techie azwile ngalo. I-Turing Test eyaziwa kakhulu yaqala ukuklanywa ngo-1950 isazi sezibalo esaziwayo nososayensi wekhompyutha u-Alan Turing.
Wavuma emsebenzini wakhe Imishini Yekhompyutha Nobuhlakani ukuthi umshini uhlakaniphile uma ukwazi ukuxoxa nomuntu futhi umkhohlise acabange ukuthi uxoxa nomuntu.
Lokhu kusebenze njengesisekelo sobuchwepheshe be-NLP. Isistimu ye-NLP esebenza kahle izokwazi ukubamba umbuzo nomongo wawo, iwuhlaziye, ikhethe inkambo engcono kakhulu yokwenza, futhi iphendule ngolimi umsebenzisi azoluqonda.
Amazinga omhlaba wonke okuqedela imisebenzi kudatha ahlanganisa ubuhlakani bokwenziwa nezindlela zokufunda zomshini. Nokho, kuthiwani ngolimi lwabantu?
Izinkambu zesizukulwane solimi lwemvelo (NLG), ukuqonda kolimi lwemvelo (NLU), kanye nokucubungula ulimi lwemvelo (NLP) zonke zithole ukunakwa okukhulu eminyakeni yamuva nje.
Kodwa ngenxa yokuthi laba abathathu banemithwalo yemfanelo ehlukene, kubalulekile ukugwema ukudideka. Abaningi bakholelwa ukuthi bayayiqonda le mibono iyonke.
Njengoba ulimi lwemvelo seluvele lukhona emagameni, konke umuntu akwenzayo wukucubungula, ukuqondisisa, nokulukhiqiza. Sinqume ukuthi kungase kusize ukujula kancane, noma kunjalo, njengoba sibhekana kaningi kangakanani nala mabinzana asetshenziswa ngokushintshana.
Ngakho-ke, ake siqale ngokubhekisisa ngayinye yazo.
Kuyini Ukucutshungulwa Kolimi Lwemvelo?
Noma yiluphi ulimi lwemvelo lubhekwa njengombhalo okhululekile ngamakhompyutha. Lokhu kulandela ukuthi ngenkathi ufaka idatha, awekho amagama angukhiye angaguquki ezindaweni ezigxilile. Ngaphezu kokungahlelwanga, ulimi lwemvelo lubuye lube nezinhlobonhlobo zezinketho zokukhuluma. Thatha le mishwana emithathu njengomfanekiso:
- Isimo sezulu sinjani namuhla?
- Ingabe namuhla inalo ithuba lokuna kwemvula?
- Ingabe namuhla idinga ukuba ngilethe isambulela sami?
Zonke lezi zitatimende zibuza ngesibikezelo sezulu sanamuhla, okuyinani elivamile.
Njengabantu, singabona ngokushesha lokhu kuxhumana okuyisisekelo futhi senze ngendlela efanele.
Noma kunjalo, lokhu kuyi- inselele kumakhompyutha njengoba yonke i-algorithm idinga okokufaka ukuze kulandele ifomethi ethile, futhi zontathu izitatimende zinezakhiwo namafomethi ahlukene.
Futhi izinto zizoba nzima maduzane uma sizama ukuhlanganisa imithetho yenhlanganisela yegama ngalinye kuzo zonke izilimi zemvelo ukuze sisize ikhompuyutha ekuqondeni. I-NLP ingena esithombeni kulesi simo.
Ukucubungula ulimi lwemvelo (NLP), oluzama uku imodeli yolimi lomuntu lwemvelo idatha, evela kulimi lwekhompyutha.
Ukwengeza, i-NLP igxile ekusebenziseni umshini wokufunda nezindlela zokufunda ezijulile kuyilapho icubungula inani elibalulekile lokokufaka komuntu. Ivame ukusetshenziswa kufilosofi, izilimi, isayensi yekhompyutha, izinhlelo zolwazi, kanye nezokuxhumana.
Ulimi lwekhompyutha, ukuhlaziya i-syntax, ukubonwa kwenkulumo, ukuhumusha ngomshini, nezinye izigatshana ze-NLP ezimbalwa kuphela. Ukucutshungulwa kolimi lwemvelo kuguqula izinto ezingahlelekile zibe yifomethi efanele noma umbhalo ohlelekile ukuze usebenze.
Ukuze uqonde ukuthi umsebenzisi usho ukuthini uma esho noma yini, yakha i-algorithm futhi iqeqeshe imodeli isebenzisa ubuningi bedatha.
Isebenza ngokuhlanganisa amabhizinisi ahlukene ukuze ahlonzwe (okwaziwa ngokuthi ukuqashelwa kwebhizinisi) nangokubona amaphethini wamagama. I-Lemmatization, i-tokenization, kanye namasu okunciphisa asetshenziselwa ukuthola amaphethini wamagama.
Ukukhishwa kolwazi, ukuqashelwa kwezwi, ukumaka ingxenye yenkulumo, nokwahlukanisa eminye yemisebenzi eyenziwa yi-NLP.
Emhlabeni wangempela, i-NLP isetshenziselwa imisebenzi ehlanganisa i-ontology populating, imodeli yolimi, ukuhlaziywa kwemizwa, ukukhishwa kwesihloko, ukuqashelwa kwebhizinisi okunegama, ukumaka izingxenye zenkulumo, ukukhipha koxhumano, ukuhumusha ngomshini, nokuphendula imibuzo ezenzakalelayo.
Kuyini Ukuqonda Ulimi Lwemvelo?
Ingxenye encane yokucubungula ulimi lwemvelo ukuqonda kolimi lwemvelo. Ngemva kokuba ulimi selwenziwe lwaba lula, isofthiwe yekhompiyutha kufanele iqonde, ithole incazelo, futhi ngokunokwenzeka ihlole imizwa.
Umbhalo ofanayo ungaba nezincazelo eziningana, imishwana eminingana ingaba nencazelo efanayo, noma incazelo ingashintsha kuye ngesimo.
Ama-algorithms e-NLU asebenzisa izindlela zokubala ukuze acubungule umbhalo ovela emithonjeni eminingi ukuze kuqondwe umbhalo wokufakwayo, okungaba okuyisisekelo njengokwazi ukuthi ibinzana lisho ukuthini noma kube nzima njengokuhumusha ingxoxo phakathi kwabantu ababili.
Umbhalo wakho uguqulwa ube yifomethi efundeka ngomshini. Ngenxa yalokho, i-NLU isebenzisa amasu okuhlanganisa ukuze ichaze umbhalo futhi ikhiqize umphumela.
I-NLU ingasetshenziswa ezimweni ezihlukahlukene, njengokuqonda ingxoxo phakathi kwabantu ababili, ukunquma ukuthi umuntu uzizwa kanjani ngesimo esithile, nezinye izimo zemvelo efanayo.
Ikakhulukazi, kunamazinga amane olimi okufanele abambe i-NLU:
- I-syntax: Lena inqubo yokunquma ukuthi uhlelo lolimi lusetshenziswa ngendlela efanele yini nokuthi imisho ihlanganiswa kanjani. Ngokwesibonelo, umongo womusho nohlelo lolimi kufanele kucatshangelwe ukuze kunqunywe ukuthi kunengqondo yini.
- I-Semantics: Uma sihlola umbhalo, ama-nuances encazelo yomongo afana ne-tenor yesenzo noma ukukhetha kwamagama phakathi kwabantu ababili akhona. Lezi zingcezu zolwazi zingaphinda ziqashwe i-algorithm ye-NLU ukunikeza imiphumela evela kunoma yisiphi isimo lapho kungasetshenziswa khona igama elifanayo elikhulunywayo.
- Ukuphikiswa komqondo wamagama: Kuyinqubo yokuthola ukuthi igama ngalinye emshweni lisho ukuthini. Kuye ngomongo, inikeza igama incazelo yalo.
- Ukuhlaziywa kwe-Pragmatic: Kuyasiza ekuqondeni ukulungiselelwa nenjongo yomsebenzi.
I-NLU ibalulekile ku ososayensi bemininingwane ngoba, ngaphandle kwayo, abanalo ikhono lokukhipha incazelo kubuchwepheshe obufana ne-chatbots nesofthiwe yokuqaphela inkulumo.
Phela, abantu bajwayele ukuba nengxoxo ne-bot ekwazi ukukhuluma; amakhompyutha, ngakolunye uhlangothi, awanakho lokhu okunethezeka kokukhululeka.
Ngaphezu kwalokho, i-NLU ingabona imizwa nenhlamba enkulumweni ngendlela ongakwazi ngayo. Lokhu kusho ukuthi ososayensi bedatha bangahlola ngokusebenzisekayo amafomethi ahlukahlukene okuqukethwe futhi bahlukanise umbhalo besebenzisa amakhono e-NLU.
I-NLG isebenza ngokuphikisana ngokuqondile nokuqonda kolimi lwemvelo, okuhloswe ngayo ukuhlela nokwenza umqondo wedatha engahlelekile ukuze iguqulelwe kudatha esebenzisekayo. Okulandelayo, ake sichaze i-NLG futhi sihlole izindlela ososayensi bedatha abayisebenzisa ngayo ezimweni ezingokoqobo zokusebenzisa.
Siyini Isizukulwane Solimi Lwemvelo?
Ukucutshungulwa kolimi lwemvelo kuhlanganisa nokukhiqizwa kolimi lwemvelo. Amakhompiyutha angabhala esebenzisa ukukhiqizwa kolimi lwemvelo, kodwa ukuqonda ulimi lwemvelo kugxile ekufundeni ngokuqondisisa.
Ngokusebenzisa okokufaka kwedatha ethile, i-NLG idala impendulo ebhaliwe ngolimi lwabantu. Amasevisi ombhalo-kuya-enkulumweni ingasetshenziswa futhi ukuguqula lo mbhalo ube yinkulumo.
Lapho ososayensi bedatha behlinzeka ngohlelo lwe-NLG ngedatha, uhlelo luhlaziya idatha ukuze lukhiqize ukulandisa okungaqondwa ngengxoxo.
Empeleni, i-NLG iguqulela amasethi edatha olimini esiluqondayo sobabili, olubizwa ngokuthi ulimi lwemvelo. Ukuze inikeze okuphumayo okucutshungulwe ngokucophelela futhi okunembe ngezinga eliphezulu elingenzeka, i-NLG inikezwe ulwazi lomuntu ophilayo ngempela.
Le ndlela, engalandelelwa emuva kweminye yemibhalo ka-Alan Turing esesixoxile ngayo, ibalulekile ekukholiseni abantu ukuthi ikhompuyutha ixoxa nabo ngendlela ezwakalayo nengokwemvelo, kungakhathaliseki ukuthi yisiphi isihloko.
I-NLG ingasetshenziswa izinhlangano ukukhiqiza ukulandisa kwengxoxo okungasetshenziswa yiwo wonke umuntu ongaphakathi enkampanini.
I-NLG, evame ukusetshenziselwa amadeshibhodi obuhlakani bebhizinisi, ukukhiqizwa kokuqukethwe okuzenzakalelayo, nokuhlaziywa kwedatha okusebenzayo, kungaba usizo olukhulu kochwepheshe abasebenza ezigabeni ezinjengokuthengisa, izinsiza zabantu, ukuthengisa, kanye nobuchwepheshe bolwazi.
Iyiphi indima edlalwa yi-NLU ne-NGL ku-NLP?
I-NLP ingasetshenziswa ososayensi bedatha kanye ukuhlakanipha okungekhona okwangempela ochwepheshe ukuze baguqule amasethi edatha angahlelekile abe amafomu amakhompyutha angakwazi ukuwahumushela enkulumweni nasembhalweni - bangakha ngisho nezimpendulo ezifanele ngokwengqikithi yombuzo obabuzayo (cabanga futhi ngabasizi ababonakalayo abafana no-Siri ne-Alexa).
Kepha i-NLU ne-NLG ingena kuphi ku-NLP?
Noma zonke zidlala indima ehlukene, yomithathu le mikhakha inento eyodwa efana ngayo: yonke ikhuluma ngolimi lwemvelo. Ngakho, uyini umehluko phakathi kwalaba abathathu?
Kucabange ngale ndlela: kuyilapho i-NLU ihlose ukuqonda ulimi olusetshenziswa abantu, i-NLP ihlonza idatha ebaluleke kakhulu futhi iyihlele ibe izinto ezifana nombhalo nezinombolo.
Ingasiza ngisho nokuxhumana okubethelwe okuyingozi. I-NLG, ngakolunye uhlangothi, isebenzisa amaqoqo edatha engahlelekile ukukhiqiza izindaba esingazihumusha njengezinenjongo.
Ikusasa le-NLP
Nakuba i-NLP inokusetshenziswa okuningi kwamanje kwezentengiselwano, amabhizinisi amaningi akuthole kunzima ukuyamukela kabanzi.
Lokhu ikakhulukazi kungenxa yalezi zinkinga ezilandelayo: Inkinga eyodwa evame ukuthinta izinhlangano ukugcwala kolwazi, okwenza kube inselele kuzo ukukhomba ukuthi yimaphi amasethi wedatha abalulekile phakathi nolwandle olungapheli lwedatha eyengeziwe.
Ukwengeza, ukuze kusetshenziswe i-NLP ngempumelelo, izinhlangano zivame ukudinga izindlela ezithile nemishini ezivumela ukuthi zikhiphe ulwazi olubalulekile kudatha.
Okokugcina, i-NLP isho ukuthi izinkampani zidinga imishini esezingeni eliphezulu uma zifisa ukuphatha nokugcina ukuqoqwa kwedatha evela emithonjeni ehlukahlukene yedatha esebenzisa i-NLP.
Naphezu kwezithiyo ezigcina inqwaba yamafemu ekwamukeleni i-NLP, kubonakala sengathi lezi zinhlangano zizogcina zamukele i-NLP, i-NLU, ne-NLG ukuze amarobhothi azo akwazi ukusekela ukusebenzelana nezingxoxo ezingokoqobo, ezifana nezomuntu.
I-Semantics ne-syntax yimikhakha engaphansi ye-NLP yocwaningo ethola ukunakwa okukhulu.
Isiphetho
Ukucabangela lokho esesikuxoxile kuze kube manje: Ukunikeza incazelo ezwini nasekubhaleni, i-NLU ifunda futhi iqonde ulimi lwemvelo, futhi i-NLG ithuthukisa futhi ikhiphe ulimi olusha ngosizo lwemishini.
Ulimi lusetshenziswa yi-NLU ukuze kukhishwe amaqiniso, kuyilapho i-NLG isebenzisa imininingwane etholwe yi-NLU ukukhiqiza ulimi lwemvelo.
Qaphela abadlali abakhulu embonini ye-IT njenge-Apple, Google, ne-Amazon ukuthi baqhubeke nokutshala imali ku-NLP ukuze bakwazi ukuthuthukisa izinhlelo abalingisa ukuziphatha komuntu.
shiya impendulo