Isiqulatho[Fihla][Bonisa]
Enye yeendlela eziphambili kulo naluphi na uhlobo lomsebenzi weshishini kusetyenziso olusebenzayo lolwazi. Ngexesha elithile, umthamo wedatha owenziweyo udlula umthamo wokulungiswa okusisiseko.
Kulapho kudlala khona ii-algorithms zokufunda koomatshini. Nangona kunjalo, ngaphambi kokuba nayiphi na kwezi zinto zenzeke, ulwazi kufuneka lufundwe kwaye lutolikwe. Ngamafutshane, yinto yokufunda koomatshini engajongwanga esetyenziselwa yona.
Kweli nqaku, siza kuphonononga nzulu umatshini wokufunda ongajongwanga, kubandakanywa algorithms, iimeko zokusetyenziswa, kunye nokunye okuninzi.
Yintoni ukuFunda koomatshini okungagadwanga?
Iindlela zokufunda zoomatshini ezingajongwanga zichonga iipateni kwiseti yedatha engenazo iziphumo ezaziwayo okanye ezilebhile. Ijongwe umatshini wokufunda iialgorithms ube nemveliso ephawulweyo.
Ukwazi lo mahluko kukunceda uqonde ukuba kutheni iindlela zokufunda zoomatshini ezingajongwanga zingasetyenziselwa ukucombulula uhlengahlengiso okanye imiba yokuhlela, ekubeni ungazi ukuba yintoni ixabiso/impendulo yedatha ephumayo. Awukwazi ukuqeqesha i-algorithm ngokuqhelekileyo ukuba awulazi ixabiso / impendulo.
Ngaphezu koko, ukufunda okungajongwanga kungasetyenziselwa ukuchonga ubume bedatha obusisiseko. Ezi algorithms zifumanisa iipateni ezifihliweyo okanye amaqela edatha ngaphandle kwesidingo sokusebenzisana kwabantu.
Amandla ayo okubona ukufana kunye nokuchasana kolwazi kwenza kube lukhetho olukhulu lokuhlalutya idatha yokuhlola, iindlela zokuthengisa ezinqamlezayo, ukwahlula kwabathengi, kunye nokuchongwa kwemifanekiso.
Qwalasela le meko ilandelayo: ukwivenkile ethengisa ukutya kwaye ubona isiqhamo esingaziwayo ongazange wasibona ngaphambili. Unokwahlula ngokulula isiqhamo esingaziwayo esahlukileyo kwezinye iziqhamo ezijikelezileyo ngokusekelwe kuqwalaselo lwakho lwemo, ubungakanani, okanye umbala.
Ii-algorithms zokuFunda koomatshini ezingagadwanga
Clustering
Ukuhlanganisana ngaphandle kwamathandabuzo yeyona ndlela isetyenziswayo yokufunda engajongwanga. Le ndlela ibeka izinto ezihambelanayo zedatha kumaqela enziwe ngokungenamkhethe.
Ngokwayo, imodeli ye-ML ifumanisa naziphi na iipateni, ukufana, kunye/okanye iiyantlukwano kulwakhiwo lwedatha olungahlelwanga. Imodeli iya kukwazi ukufumana nawaphi na amaqela endalo okanye iiklasi kwidatha.
iintlobo
Kukho iindlela ezininzi zokudibanisa ezinokuthi zisetyenziswe. Makhe sijonge kwezona zibalulekileyo kuqala.
- Ukudibanisa okukodwa, ngamanye amaxesha okwaziwa njengokuhlanganisa "okuqinileyo", luhlobo lokudibanisa apho iqhekeza elinye ledatha likwiqela elinye.
- Ukudityaniswa okuthe kratya, okukholisa ukwaziwa njengokuhlanganisana “okuthambileyo”, kuvumela izinto zedatha ukuba zezeqela elingaphezulu kwesinye ukuya kumaqondo ahlukeneyo. Ngaphaya koko, ukuhlanganisana okunokwenzeka kunokusetyenziselwa ukujongana “neengxaki ezithambileyo” zentlanganisela okanye iingxaki zoqikelelo loxinaniso, kunye nokuvavanya ukuba nokwenzeka okanye ukuba nokwenzeka kwamanqaku edatha yamaqela athile.
- Ukudala uluhlu lwezinto eziqokelelweyo zedatha yinjongo yokuhlanganisana ngokwemigangatho, njengoko igama libonisa. Izinto zedatha ziyalungiswa okanye zidityaniswe ngokusekwe kuluhlu lwemigangatho ukuze kuveliswe amaqela.
Sebenzisa iimeko:
- Ukufunyaniswa okungaqhelekanga:
Naluphi na uhlobo lwangaphandle kwidatha lunokubonwa ngokusebenzisa i-clustering. Iinkampani kwezothutho kunye nolungiselelo, umzekelo, zinokusebenzisa ubhaqo olungaqhelekanga ukufumanisa izithintelo zolungiselelo okanye ukuxela iindawo ezonakeleyo zoomatshini (ugcino oluqikelelweyo).
Amaziko emali angasebenzisa iteknoloji ukuze abone ukuthengiselana ngobuqhetseba kwaye aphendule ngokukhawuleza, oko kunokonga imali eninzi. Funda ngakumbi malunga nokubona izinto ezingaqhelekanga kunye nobuqhetseba ngokubukela ividiyo yethu.
- Ukwahlulwa kwabathengi kunye neemarike:
Ukuhlanganiswa kwe-algorithms kunokuncedisa ekuhlanganiseni abantu abaneempawu ezifanayo kunye nokudala abantu abathengi ukuze bathengiswe ngempumelelo kunye namanyathelo ekujoliswe kuwo.
K-Iindlela
I-K-indlela yindlela yokudibanisa eyaziwa ngokuba ngukwahlula okanye ukwahlula. Yahlula amanqaku edatha kwinani elimiselweyo lamaqela aziwa ngokuba yi-K.
Kwindlela ye-K, i K ligalelo njengoko uxelela ikhomputha ukuba mangaphi amaqela ofuna ukuwachonga kwidata yakho. Into nganye yedatha inikezelwa kwiziko leqela elikufutshane, elaziwa ngokuba yi-centroid (amachaphaza amnyama emfanekisweni).
Le yokugqibela isebenza njengeendawo zokugcina idatha. Ubuchule bokudibanisa bunokwenziwa ngamaxesha amaninzi de amaqela achazwe kakuhle.
UFuzzy K-indlela
I-Fuzzy K-indlela yokwandiswa kobuchule beendlela ze-K, ezisetyenziselwa ukwenza ukuhlangana okudityanisiweyo. Ngokungafaniyo nobuchule beendlela ze-K, ii-K-indlela ezingaqondakaliyo zibonisa ukuba amanqaku edatha anokuba ngamaqela amaninzi anemigangatho eyahluka-hlukeneyo yokusondelelana kwelinye.
Umgama phakathi kwamanqaku edatha kunye necentroid yeqela lisetyenziselwa ukubala ukusondela. Ngenxa yoko, kunokubakho amaxesha apho amaqela ahlukeneyo adibanayo.
Iimodeli zoMxube weGaussian
IiModeli zoMxube weGaussian (GMMs) yindlela esetyenziswa ekuhlanganiseni okunokwenzeka. Ngenxa yokuba intsingiselo kunye nokwahluka kungaziwa, iimodeli zicinga ukuba kukho inani elimiselweyo losasazo lwe-Gaussian, nganye imele iqela elahlukileyo.
Ukumisela ukuba yeyiphi iqoqo indawo yedatha ethile, indlela isetyenziswa ngokusisiseko.
Iqela le-Hierarchical Clustering
Isicwangciso soluhlu lwemigangatho sinokuqala ngendawo nganye yedatha eyabelwe iqela elahlukileyo. Amaqela amabini asondelelene kwelinye emva koko adityaniswa abe liqela elinye. Ukudityaniswa okuphindiweyo kuyaqhubeka kude kube liqela elinye kuphela eliseleyo phezulu.
Le ndlela yaziwa ngokuba yi-bottom-up okanye i-agglomerative. Ukuba uqala ngayo yonke imiba yedatha ebotshelelwe kwiqela elinye kwaye emva koko uqhube ukwahlula de into nganye yedatha yabelwe njengeqela elahlukileyo, indlela yaziwa ngokuba yi-top-down okanye i-hierarchical clustering eyahlulayo.
I-algorithm ye-Apriori
Uhlalutyo lwebhasikithi yentengiso lwandise ii-algorithms ze-apriori, ezikhokelela kwiinjine ezahlukeneyo zeengcebiso zamaqonga omculo kunye neevenkile ze-intanethi.
Ziyasetyenziswa kwiiseti zentengiselwano ukufumana izinto ezihlala rhoqo, okanye amaqela ezinto, ukuze kuqikelelwe ukuba kunokwenzeka ukuba kusetyenziswe imveliso enye ngokusekwe kusetyenziso kwenye.
Umzekelo, ukuba ndiqala ukudlala unomathotholo we-OneRiphabhlikhi kwiSpotify kunye "nokubala iinkwenkwezi," enye yeengoma ezikwesi jelo ngokuqinisekileyo iya kuba yingoma ye-Imagine Dragon, efana "noBad Liar."
Oku kusekelwe kwimikhwa yam yangaphambili yokuphulaphula kwakunye neendlela zokuphulaphula zabanye. Iindlela ze-Apriori zibala izinto kusetyenziswa umthi we-hash, ukunqumla i-dataset ububanzi-kuqala.
UkuNcitshiswa kobungakanani
Ukunciphisa ubungakanani luhlobo lokufunda olungagadwanga olusebenzisa ingqokelela yezicwangciso zokunciphisa inani leempawu - okanye imilinganiselo - kwidathasethi. Sivumele ukuba sicacise.
Kungahenda ukubandakanya idatha eninzi kangangoko kunokwenzeka ngelixa usenza eyakho iseti yedatha yokufunda koomatshini. Ungasiphazamisi: esi sicwangciso sisebenza kakuhle kuba idatha eninzi ihlala inika iziphumo ezichanekileyo.
Cinga ukuba idatha igcinwe kwindawo ye-N-dimensional, kunye nenqaku ngalinye elimele i-dimensional eyahlukileyo. Kunokubakho amakhulu emilinganiselo ukuba kukho idatha eninzi.
Qwalasela i-Excel spreadsheets, enekholamu emele iimpawu kunye nemigca emele izinto zedatha. Xa kukho imilinganiselo emininzi kakhulu, ii-algorithms zeML zinokusebenza kakubi kwaye ukubonwa kwedatha kunokuba nzima.
Ke yenza kube sengqiqweni ukunciphisa iimpawu okanye imilinganiselo, kwaye udlulise ulwazi olufanelekileyo. Ukuncitshiswa kweDimensionality kunjalo nje. Ivumela ubungakanani obulawulekayo bamagalelo edatha ngaphandle kokubeka esichengeni ukuthembeka kweseti yedatha.
Uhlalutyo lweCandelo eliyiNtloko (PCA)
Uhlalutyo lwecandelo eliphambili yindlela yokunciphisa i-dimensionality. Isetyenziselwa ukunciphisa inani leempawu kwiiseti zedatha ezinkulu, okukhokelela kubulula obukhulu bedatha ngaphandle kokuncama ukuchaneka.
Uxinzelelo lweseti yedatha lwenziwa ngendlela eyaziwa ngokuba yisici sokutsalwa. Ibonisa ukuba izinto ezisuka kwiseti yokuqala zidityaniswa zibe yintsha, encinci. Ezi mpawu zintsha zaziwa njengeenxalenye eziphambili.
Ewe kunjalo, kukho ii-algorithms ezongezelelweyo onokuzisebenzisa kwizicelo zakho zokufunda ezingajongwanga. Ezi zidweliswe ngasentla zezona zixhaphakileyo, yiyo loo nto zixutyushwa ngokweenkcukacha.
Ukusetyenziswa kokufunda okungajongwanga
- Iindlela zokufunda ezingajongwanga zisetyenziselwa imisebenzi yembono yokubona efana nokuqaphela into.
- Ukufundwa komatshini okungajongwanga kunika imiba ebalulekileyo kwiinkqubo zokucinga zonyango, ezifana nokuchongwa kwemifanekiso, ukuhlelwa, kunye nokwahlulahlula, ezisetyenziswa kwi-radiology kunye ne-pathology ukuxilonga izigulane ngokukhawuleza nangokuthembekileyo.
- Ukufunda okungajongwanga kunokuncedisa ekuchongeni iintsingiselo zedatha ezinokuthi zisetyenziswe ukwenza izicwangciso ezisebenzayo zokuthengisa ezinqamlezayo kusetyenziswa idatha yexesha elidlulileyo kwindlela yokuziphatha kwabathengi. Ngexesha lenkqubo yokuphuma, oku kusetyenziswa ngamashishini e-intanethi ukucebisa izongezo ezifanelekileyo kubathengi.
- Iindlela zokufunda ezingajongwanga zinokuhluza imithamo emikhulu yedatha ukufumana abangaphandle. Oku kungahambi kakuhle kunokuphakamisa isaziso sezixhobo ezingasebenzi kakuhle, impazamo yomntu, okanye ukophulwa kokhuseleko.
Imiba yokufunda okungajongwanga
Ukufunda okungajongwanga kuyathandeka ngeendlela ezahlukeneyo, ukusuka ekufumaneni ulwazi olubalulekileyo idatha ekuthinteleni ukuleyibhelishwa kwedatha okuxabisa kakhulu imisebenzi. Nangona kunjalo, kukho iingxaki ezininzi ekusebenziseni esi sicwangciso sokuqeqesha iimodeli zokufunda ngomatshini ukuba kufuneka uyazi. Nantsi eminye imizekelo.
- Njengoko idatha yegalelo ingenazo iilebhile ezisebenza njengezitshixo zokuphendula, iziphumo zemifuziselo yokufunda ezingajongwanga zingachaneka kangako.
- Ukufunda okungajongwanga rhoqo kusebenza ngeeseti zedatha ezinkulu, ezinokunyusa ukuntsokotha kwezibalo.
- Le ndlela ifuna ukuqinisekiswa kwemveliso ngabantu, nokuba ziingcali zangaphakathi okanye zangaphandle kumbandela wophando.
- I-algorithms kufuneka ivavanye kwaye ibale yonke imeko enokwenzeka kwisigaba sonke soqeqesho, esithatha ixesha elithile.
isiphelo
Ukusetyenziswa kwedatha okusebenzayo ngundoqo ekusekeni umda wokhuphiswano kwimarike ethile.
Unokwahlula idatha usebenzisa i-algorithms yokufunda koomatshini ongajongwanga ukujonga izinto ozikhethayo kubaphulaphuli ojolise kubo okanye ukujonga indlela usulelo oluthile oluphendula ngayo kunyango oluthile.
Zininzi izicelo ezisebenzayo, kwaye izazinzulu zedatha, iinjineli, kunye nabayili bezakhiwo banokukunceda ekuchazeni iinjongo zakho kunye nokuphuhlisa izisombululo ezizodwa zeML zenkampani yakho.
Shiya iMpendulo