Isiqulatho[Fihla][Bonisa]
Abaphandi kunye nosonzululwazi bedatha bahlala bedibana neemeko apho bathi bangabinayo idatha yokwenyani okanye bangakwazi ukuyisebenzisa ngenxa yobumfihlo okanye uqwalaselo lwabucala.
Ukujongana nalo mba, imveliso yedatha eyenziweyo isetyenziselwa ukuvelisa ukutshintshwa kwedatha yokwenyani.
Ukutshintshwa okufanelekileyo kwedatha yokwenyani kuyafuneka ukuze i-algorithm isebenze ngokufanelekileyo, ekufuneka nayo ibe yinyani kuphawu. Ungasebenzisa loo datha ukugcina ubumfihlo, iinkqubo zokuvavanya, okanye ukuvelisa idatha yoqeqesho lwe-algorithms yokufunda koomatshini.
Masiphonononge ukuveliswa kwedatha yokwenziwa ngokweenkcukacha kwaye sibone ukuba kutheni zibalulekile kwixesha le-AI.
Yintoni i-Synthetic Data?
Idatha ye-Synthetic yidatha echaziweyo eveliswa ngokulinganisa ikhompyuter okanye i-algorithms njengendawo yolwazi lwehlabathi lokwenyani. Yingxelo eyenziwe ngobukrelekrele bokwenziwa kwedatha eyiyo.
Umntu unokusebenzisa iipateni zedatha kunye nemilinganiselo esebenzisa i-AI algorithms. Banokudala inani elingenamda ledatha eyenziweyo emele idatha yoqeqesho lokuqala xa sele beqeqeshiwe.
Kukho iindlela ezahlukeneyo kunye nobuchwepheshe obunokusinceda senze idatha yokwenziwa kwaye ungasebenzisa kwiinkqubo ezahlukeneyo.
Isoftware yokuvelisa idatha ihlala ifuna:
- Imethadatha yendawo yokugcina idatha, apho idatha yokwenziwa kufuneka yenziwe.
- Ubuchwephesha bokuvelisa amaxabiso abambekayo kodwa ayintsomi. Imizekelo ibandakanya uluhlu lwamaxabiso kunye nentetha eqhelekileyo.
- Ulwazi olubanzi lwazo zonke ubudlelwane bedatha, ezo zibhengezwe kwinqanaba lesiseko sedatha kunye nezo zilawulwa kwinqanaba lekhowudi yesicelo.
Kuyimfuneko ngokulinganayo ukuqinisekisa imodeli kwaye uthelekise iinkalo zokuziphatha zedatha yangempela kwezo zenziwe yimodeli.
Ezi seti zedatha zobuxoki zinalo lonke ixabiso lento yokwenyani, kodwa akukho nanye idatha enovakalelo. Kufana nekhekhe elimnandi, elingenakhalori. Ibonisa ngokuchanekileyo ihlabathi lokwenene.
Ngenxa yoko, ungayisebenzisa ukubuyisela idatha yelizwe lokwenyani.
Ukubaluleka kweDatha yeSynthetic
Idatha ye-Synthetic ineempawu zokuhambelana neemfuno ezithile okanye iimeko ezinokuthi zingabikho kwidatha yelizwe lokwenyani. Xa kukho ukunqongophala kwedatha yokuvavanya okanye xa ubumfihlo buyingqwalasela ephezulu, kuya kuhlangula.
Iiseti zedatha ezenziwe nge-AI ziyaguquguquka, zikhuselekile, kwaye kulula ukuzigcina, ukutshintshiselana, kunye nokulahla. Indlela yokudibanisa idatha ifanelekile ukusetwa kunye nokuphucula idatha yokuqala.
Ngenxa yoko, ilungele ukusetyenziswa njengedatha yovavanyo kunye nedatha yoqeqesho lwe-AI.
- Ukufundisa i-ML-based Uber kunye Iimoto eziziqhubayo zikaTesla.
- Kumashishini ezonyango kunye nezempilo, ukuvavanya izifo ezithile kunye neemeko apho idatha yokwenyani ingekho khona.
- Ukufunyanwa nokhuseleko lobuqhophololo kubalulekile kwicandelo lezemali. Ngokuyisebenzisa, unokuphanda iimeko ezintsha zobuqhetseba.
- I-Amazon iqeqesha inkqubo ye-Alexa yolwimi isebenzisa idatha yokwenziwa.
- I-American Express isebenzisa idatha yokwenziwa kwezemali ukuphucula ukubhaqwa kobuqhophololo.
Iintlobo zeeDatha zokwenziwa
Idatha ye-synthetic idalwe ngokungacwangciswanga ngenjongo yokufihla ulwazi oluyimfihlo lwabucala ngelixa ugcina ulwazi lweenkcukacha-manani malunga neempawu kwidatha yokuqala.
Luziindidi ezintathu ubukhulu becala:
- Idatha yokwenziwa ngokupheleleyo
- Idatha yokwenziwa kancinci
- Idatha yokwenziwa kweHybrid
1. Idatha ye-Synthetic ngokupheleleyo
Le datha yenziwe ngokupheleleyo kwaye ayinayo idatha yoqobo.
Ngokuqhelekileyo, i-data generator yolu hlobo iya kuchonga imisebenzi yobuninzi beempawu kwidatha yangempela kwaye iqikelele iiparameters zabo. Kamva, ukusuka kwimisebenzi yoxinaniso oluqikelelweyo, uthotho olukhuselweyo lwabucala lwenziwa ngokungakhethiyo kuphawu ngalunye.
Ukuba nje iimpawu ezimbalwa zedatha yokwenyani zikhethwa ukuba zitshintshwe ngayo, uthotho olukhuselweyo lwezi mpawu lubekwe kwiimpawu eziseleyo zedatha yokwenyani ukukala uthotho olukhuselweyo nolwenyani ngokulandelelana okufanayo.
Ubuchwephesha beBootstrap kunye neempembelelo ezininzi ziindlela ezimbini zemveli zokuvelisa idatha eyenziwe ngokupheleleyo.
Ngenxa yokuba idatha yenziwe ngokupheleleyo kwaye akukho datha yokwenyani ekhoyo, esi sicwangciso sibonelela ngokhuseleko lwabucala olubalaseleyo ngokuthembela kwinyani yedatha.
2. Idatha ye-Synthetic engaphelelanga
Le datha isebenzisa kuphela amaxabiso okwenziwa ukubuyisela amaxabiso eempawu ezimbalwa ezinovakalelo.
Kule meko, amaxabiso okwenene aguqulwa kuphela ukuba kukho ingozi enkulu yokuvezwa. Olu tshintsho lwenziwa ukukhusela ubumfihlo bedatha eyenziwe ngokutsha.
Iindlela ezininzi zokulinganisa kunye nemodeli esekwe kwimodeli zisetyenziselwa ukuvelisa idatha yokwenziwa kwenxalenye. Ezi ndlela zingasetyenziselwa ukuzalisa amaxabiso alahlekileyo kwidatha yelizwe lokwenyani.
3. Hybrid Synthetic Data
Idatha yokwenziwa kweHybrid ibandakanya zombini idatha yokwenyani kunye neyobuxoki.
Irekhodi elisondeleyo kuyo likhethwa kwirekhodi nganye yedatha yokwenyani, kwaye ezi zimbini zidityaniswa ukuvelisa idatha ye-hybrid. Ineenzuzo zombini ngokupheleleyo kunye nedatha eyenziweyo.
Ke ngoko inikezela ngogcino oluyimfihlo oluluqilima oluluncedo oluphezulu xa kuthelekiswa nezinye ezimbini, kodwa ngeendleko zememori eninzi kunye nexesha lokucubungula.
Ubuchwephesha bokuVeliswa kweeNkcukacha zokwenziwa
Kwiminyaka emininzi, ingcamango yedatha eyenziwe ngomatshini idumile. Ngoku iyakhula.
Nazi ezinye iindlela zobuchule ezisetyenziselwa ukuvelisa idatha yokwenziwa:
1. Ngokusekelwe ekusasazeni
Kwimeko apho akukho datha yokwenyani ikhona, kodwa umhlalutyi wedatha unengcamango ecacileyo malunga nendlela ukuhanjiswa kwedatha okuza kubonakala ngayo; banokuvelisa isampulu engacwangciswanga yalo naluphi na usasazo, kubandakanywa i-Normal, Exponential, Chi-square, t, lognormal, kunye ne-Uniform.
Ixabiso ledatha yokwenziwa kule ndlela iyahluka ngokuxhomekeka kwinqanaba lomhlalutyi lokuqonda malunga nendawo ethile yedatha.
2. Idatha ye-real-world kwi-distribution eyaziwayo
Amashishini anokuyivelisa ngokuchonga ezona zinikezelo zifanelekileyo zokunikezelwa kwedatha yokwenyani ukuba kukho idatha yokwenyani.
Amashishini angasebenzisa indlela ye-Monte Carlo ukuyivelisa ukuba banqwenela ukufaka idatha yangempela kwi-distribution eyaziwayo kunye nokwazi iiparamitha zokusabalalisa.
Nangona indlela yeMonte Carlo inokunceda amashishini ekufumaneni owona mdlalo mkhulu ukhoyo, eyona ifanelekileyo isenokungasetyenziswa ngokwaneleyo kwiimfuno zedatha zokwenziwa zenkampani.
Amashishini anokuhlola ukusebenzisa iimodeli zokufunda koomatshini ukuze zilungele ukuhanjiswa kwezi meko.
Ubuchule bokufunda ngoomatshini, obufana nemithi yezigqibo, buvumela imibutho ukuba ibonise unikezelo olungelulo olwakudala, olunokuba lweendlela ezininzi kwaye lunqongophele iimpawu eziqhelekileyo zonikezelo olwaziwayo.
Amashishini anokuvelisa idatha yokwenziwa eqhagamshela kwidatha yokwenyani isebenzisa olu lwazi lufakwe ngomatshini.
kunjalo, iimodeli zokufunda ngomatshini zisesichengeni sokufihlwa ngokugqithisileyo, nto leyo ebangela ukuba basilele ukungqamanisa idatha entsha okanye baqikelele imigqaliselo yexesha elizayo.
3. Ukufunda nzulu
Iimodeli ezinzulu zokuvelisa ezifana ne-Variational Autoencoder (VAE) kunye ne-Generative Adversarial Network (GAN) inokuvelisa idatha yokwenziwa.
I-Autoencoder eyahluka-hlukeneyo
I-VAE yindlela engagadwanga apho i-encoder inxinzelela isethi yedatha yokuqala kwaye ithumela idatha kwidikhowuda.
Idekhowuda ke ivelise imveliso eluphawu lwedata yoqobo.
Ukufundisa inkqubo kubandakanya ukwandisa ulungelelwaniso phakathi kwedatha yegalelo kunye nemveliso.
INethiwekhi yoNcedo oluVelayo
Imodeli ye-GAN iphinda iqeqeshe imodeli isebenzisa iinethiwekhi ezimbini, i-generator, kunye nomcaluli.
I-generator yenza i-dataset yokwenziwa ukusuka kwiseti yedatha yesampuli engahleliwe.
Umcaluli uthelekisa idatha eyenziwe ngokuzenzekelayo kwidatha yokwenyani usebenzisa iimeko ezichazwe kwangaphambili.
Ababoneleli beDatha bokwenziwa
Idatha emiselweyo
Iiplatifti ezikhankanywe ngezantsi zibonelela ngedatha yokwenziwa evela kwidatha yetheyibhile.
Iphindaphinda idatha yelizwe langempela egcinwe kwiitheyibhile kwaye ingasetyenziselwa ukuziphatha, ukuqikelela, okanye uhlalutyo lwentengiselwano.
- Faka i-AI: Ingumnikezeli wenkqubo yokudala idatha yokwenziwa esebenzisa i-Generative Adversarial Networks kunye nokwahluka kwabucala.
- Idatha engcono: Ingumboneleli wesisombululo sedatha eyimfihlo egcina imfihlo ye-AI, ukwabelana ngedatha, kunye nophuhliso lwemveliso.
- Divepale: Ingumboneleli we-Geminai, inkqubo yokudala iiseti zedatha 'amawele' aneempawu zobalo ezifanayo njengedatha yokuqala.
Idatha engamiselwanga
Amaqonga akhankanywe apha ngezantsi asebenza ngedatha engacwangciswanga, ebonelela ngempahla yedatha eyenziweyo kunye neenkonzo zombono woqeqesho kunye ne-algorithms yolwazi.
- Idathagen: Ibonelela ngedatha yoqeqesho olufanisiweyo lwe-3D yokufunda nophuhliso lwe-Visual AI.
- Neurolabs: I-Neurolabs ngumboneleli weqonga ledatha yokwenziwa kombono wekhompyuter.
- Ummandla onxuseneyo: Ingumboneleli weqonga ledatha yokwenziwa yoqeqesho lwenkqubo yokuzimela kunye namatyala okusetyenziswa kovavanyo.
- Cognata: Ngumboneleli wokulinganisa we-ADAS kunye nabaphuhlisi bezithuthi ezizimeleyo.
- Bifrost: Ibonelela ngee-APIs zedatha yokwenziwa ekudaleni iindawo ze-3D.
mngeni
Inembali ende kwi Kukubhadla okungeyonyani, kwaye ngelixa ineenzuzo ezininzi, nayo inezithintelo ezibalulekileyo ekufuneka ujongane nazo ngelixa usebenza ngedatha yokwenziwa.
Nazi ezinye zazo:
- Iimpazamo ezininzi zinokubakho ngelixa ukhuphela ubunzima ukusuka kwidatha eyiyo ukuya kwidatha yokwenziwa.
- Ubume bayo obuthambileyo bukhokelela kucalucalulo ekuziphatheni kwayo.
- Kusenokubakho iziphene ezifihliweyo ekusebenzeni kwe-algorithms eqeqeshwe kusetyenziswa ukubonakaliswa okulula kwedatha yokwenziwa esandul 'ukuvela ngelixa ijongene nedatha yangempela.
- Ukuphindaphinda zonke iimpawu ezifanelekileyo kwidatha yelizwe lokwenyani kunokuba nzima. Kusenokwenzeka ukuba eminye imiba ebalulekileyo inokungahoywa ngalo lonke ixesha lomsebenzi.
isiphelo
Ukuveliswa kwedatha yokwenziwa ngokucacileyo kutsala ingqalelo yabantu.
Le ndlela isenokungabi yimpendulo elinganayo kuzo zonke iimeko zokuvelisa idatha.
Ngaphandle koko, ubuchule busenokufuna ubukrelekrele nge-AI/ML kwaye bukwazi ukujongana neemeko ezintsonkothileyo zehlabathi zokwenyani zokudala idatha enxulumeneyo, ngokufanelekileyo idatha efanelekileyo kwisizinda esithile.
Nangona kunjalo, bubuchwephesha obutsha obuzalisa isithuba apho obunye ubuchwephesha obuvumela ubumfihlo busilela.
Namhlanje, i-synthetic ukuveliswa kwedatha kunokufuna ukuhlalisana kokufihla idatha.
Kwixesha elizayo, kunokubakho ukudibanisa okukhulu phakathi kwezi zimbini, okukhokelela kwisisombululo esibanzi sokwenza idatha.
Yabelana ngezimvo zakho kwizimvo!
Shiya iMpendulo