I-Google imemezele i-MusicLM, ubuhlakani bokwenziwa obudala umculo ngamagama owabhalayo, njenge-DALL-E 2. Kuyimodeli yolimi edalwe i-Google Research. Ngaphandle kwalokho, bayiklamele ngokukhethekile ukudala umculo.
Futhi, iqeqeshwe kudathasethi enkulu yamafayela omculo futhi ingakhiqiza umculo ngezitayela namafomu ahlukahlukene. Uma uthanda umculo; khona-ke kufanele uhlole ukuthi i-MusicLM izokunikeza ini.
Nge-MusicLM ukhiqiza umculo ngamasu namafomu athile. Isibonelo, ungakha izingcezu zepiyano, izigqi zezigubhu, nemiculo yamagama.
Futhi, ungakwazi ukushuna kahle izitayela ezithile noma ufake okokufaka okunikezwa umsebenzisi. Kwenzelwe ukukhiqiza umculo ohambisanayo futhi ohambisana nesigqi. Ngakho-ke, ake sicwilise futhi sibone ukuthi i-MusicLM imayelana nani.
Imizamo Yangaphambilini
I-MusicLM akulona uhlelo lokuqala lomculo olukhiqizwa yi-AI. I-Riffusion, Dance Diffusion, i-AudioML ye-Google, kanye ne-OpenAI's I-Jukebox ziyizibonelo zezindlela eziqhathanisekayo. Kodwa-ke, lezi zinhlelo zangaphambilini zavinjelwa imikhawulo yezobuchwepheshe.
Futhi, ukuntula kwabo idatha yokuqeqeshwa kwenza kwaba nzima ukuqamba izingoma zekhwalithi ephezulu. Kodwa-ke, i-MusicLM inamandla okudala umculo ngezinga elikhulu lokuyinkimbinkimbi kanye namaqiniso.
Uhlolojikelele MusicLM
I-MusicLM ifunda ukwakheka nesitayela somculo. Ngakho-ke, iqeqeshwa kudathasethi enkulu ye-MIDI namafayela omculo womfanekiso. Njengezinhlelo zayo ezifanayo, i-MusicLM yakhelwe ekwakhiweni kweTransformer.
Kusetshenziswa amasu okuzinaka ukuze kugxilwe ezingxenyeni ezithile zokufakwayo, i-architecture ye-MusicLM yesiguquli isetshenziselwa ukukhipha isakhiwo nesitayela somculo kudathasethi enkulu. Ngenxa yalokho, ungakha umculo ohambisanayo futhi ohambisana nesigqi.
Futhi, lo mculo ungalingisa inhlangano yokokufaka komsebenzisi. Ngakho-ke, uzokwazi ukuthola umphumela womculo owuchaza ngokuqondile ohlelweni.
Impumelelo yangaphambilini amamodeli olimi, njenge-GPT-2 ne-GPT-3, eziye zafakazela amandla azo okudala ukubhala okuhambisanayo nokushelelayo, i-MusicLM ephefumulelwe. Ngakolunye uhlangothi, iMusicLM iyimodeli yolimi lokuqala eyakhelwe isizukulwane somculo kuphela.
Futhi, sicabanga ukuthi izothathwa njengenye yamamodeli ayinkimbinkimbi kakhulu.
Isebenza kanjani?
I-DALL-E 2 ne-Google MusicLM ukuhlakanipha okungekhona okwangempela ukwabelana okuningi ukufana kwesakhiwo. Nokho, kulokhu, ukubhala kwakho kudluliselwa ngomculo kunokuba kubonakale. Kuleli qophelo, ungakwazi ukwakha ucezu ngokuphelele. Futhi, ungakwazi ukukhiqiza isigqi usebenzisa insimbi eyodwa kuphela.
Ungabuka amasampula ocwaningo ambalwa adalwe ithimba le-Google AI ekhasini le-Github le-MusicLM. Noma i-AI isesigabeni socwaningo nentuthuko, imisindo engayenza iphezulu. Futhi, kube neziphakamiso, njengokuhlanganisa le AI ne-ChatGPT. Lokhu kuhlanganiswa kungaholela emculweni oyinkimbinkimbi nobuciko.
Kusukela kuHumming kuya ku-Hit Melodies
I-MusicLM ihlanganisa amamodeli e-AI amane ahlukene: i-MuLan, i-AudioLM, i-w2v-BERT, ne-Soundstream. Nakuba ngayinye yalezi zinhlobo inesethi yamakhono ahlukile. Kodwa-ke, lapho sezihlanganisiwe, zaphumela kuMusicLM!
Abaculi nezingcweti zemboni baqaphele ikhono le-MusicLM lokuguqula ngisho nokucula nokububula okuyisisekelo kube izingoma eziphelele. Ngokuhlanganisa ne-ChatGPT, ingakhiqiza umculo oyingqayizivele.
Ungalalela futhi uhlole umculo nemisindo edalwe i-MusicLM kuyo iwebhusayithi. Kodwa, khumbula ukuthi okwamanje isesigabeni sokuhlola. Kusobala ukuthi iMusicLM inamandla okuguqula ngokuphelele ibhizinisi lomculo njengoba ubuchwepheshe buthuthuka.
Umculo Okhiqizwe Nge-AI onama-nuances Afana Nomuntu
Ukukhiqiza izingoma ezinengqondo ezisekelwe ezincazelweni eziphelele, i-MusicLM yaqeqeshwa kudathasethi enkulu yamahora angu-280,000 omculo. Isibonelo, ungakha "i-melodic dubstep tune ene-bass ejulile kanye nezigqi eziyinkimbinkimbi zesigubhu". Noma, ungayicela ukuthi idale “ingoma ye-pop ehehayo enesigingci esikhangayo kanye nomdwebi wezwi onamandla.” Umcabango wakho uwumkhawulo kulokhu.
Izingoma ezikhiqiziwe zifana nalezo eziqanjwe abaculi abangabantu. Amasampula e-MusicLM ayamangaza kakhulu. Kuyiqiniso ikakhulukazi uma kubhekwa ukuthi akekho umuntu ohilelekile enqubweni yokuqamba. I-MusicLM ingaphinda izici ezicashile ezifana nokukhala komculo, imiculo, kanye nemizwa. Ngaphandle kwalokho, isebenza noma inikezwe imininingwane eyinkimbinkimbi futhi ecacile.
Izici ezibalulekile
Umdwebo Wamagama-ncazo Conditioning
I-Painting Caption Conditioning iwumsebenzi we-MusicLM. Ungakwazi ukukhiqiza umculo ngokusekelwe encazelweni yombhalo noma "amazwibela" omdwebo. Lokhu kusho ukuthi i-MusicLM iyakwazi ukwenza umculo othwebula imizwa, imizwa, nemibono evezwa esithombeni. Leli khono lisiza kakhulu ekwenzeni umculo wamamuvi, amageyimu evidiyo, kanye nazo zonke izinhlobo zemidiya ebonakalayo.
Imodi Yezindaba
Isici Semodi Yendaba sithatha umbhalo wendaba njengokufakwayo. Ngakho-ke, idala umculo wangemuva ohambisana nawo. Abasebenzisi bangasebenzisa lo msebenzi ukuze bakhe iculo lenganekwane, igeyimu yevidiyo, noma imuvi ngokubonisa isimo noma ithoni yemizwa.
I-Story Mode iyithuluzi eliwusizo labaculi bemidiya. Ngakho, ingakwazi ukukhiqiza izinhlobo eziningi zezitayela zomculo nezinsimbi. I-MusicLM's Tale Mode ikhulisa umuzwa wesigcawu. Ngakho-ke, ababukeli bangaba nezinga elengeziwe lokucwiliswa endabeni.
Izinga Lokuzizwisa Umculi
Ungakwazi ukwenza ngokwezifiso ubunzima bomculo owenziwe. Abasebenzisi bangakhetha phakathi kwamaleveli amathathu ngokusekelwe ezingeni labo lamakhono. Futhi, bangacacisa izinga elincanyelwayo lobunkimbinkimbi: abaqalayo, abaphakathi, noma abathuthukile.
Lesi sici siyakusiza uma unolwazi oluncane lomculo futhi ufuna ukuzama izingoma ezintsha. Nokho, uma ungumculi onesipiliyoni, ungakha umculo oyinkimbinkimbi nocashile. Umgomo we-MusicLM ngalesi sici ukuletha ulwazi olufinyelelekayo kubo bonke abasebenzisi.
Ukuhlukahluka Kwesizukulwane
Ngomsebenzi weGeneration Diversity, ungakhiqiza izinguqulo eziningi zengoma ngokufaka okufanayo. Futhi, ungaba nebanga elihlukahlukene lokuphumayo. Lokhu kusho ukuthi i-AI ingase ikhiqize izinguqulo eziningi zengoma.
Ngaphandle kwalokho, kunezinye izingoma noma ukuqhubekela phambili kwezingoma, kuyilapho kugcinwa isitayela esiyisisekelo nesakhiwo sengoma. Lesi sici sisiza ukudalwa komculo we-AI ukuthi kube nobuciko obuningi. Ngakho-ke, kwenza ukudalwa komculo kufane nokubhala amaculo womuntu.
Imikhawulo engaba khona ye-MusicLM
I-Google ayikayenzi i-MusicLM itholakale emphakathini njengoba isathuthukiswa. Ngakho-ke, awukwazi ukunikeza amasampula athile ezinhlobo zomculo ezingakhiqizwa yi-MusicLM. Ngaphezu kwalokho, namanje akwaziwa ukuthi imiphi imikhawulo i-MusicLM engaba nayo.
Njengoba ubuchwepheshe busesigabeni sokuqala, bungaba nemikhawulo ethile kuzinga lomculo okhiqizwayo noma amandla awo okuphatha okufakiwe okuthile.
Ikhwalithi ehlanekezelwe yamasampuli akhiqiziwe ingenye yezingqinamba ezibalulekile. Lokhu kuwumkhiqizo odingekayo wenqubo yokuqeqesha esetshenziselwa ukuthuthukisa i-MusicLM.
Enye inselelo ukuthi, naphezu kwekhono lobuchwepheshe le-MusicLM lokukhiqiza amazwi. Lokhu kuhlanganisa izingoma zamakhwaya. "Izingoma" ezikhiqizwe yi-MusicLM ngezinye izikhathi zibukeka njenge-gibberish. Ngaphandle kwalokho, kungaba nzima ukuwaqonda. Nokho, i-MusicLM isathuthukiswa futhi lezi zinkinga zingathuthukiswa.
Izimpawu Zokugcina
Okokugcina, sikholelwa ukuthi ubuchwepheshe obungaphansi kwe-Google MusicLM buyathakazelisa futhi buyathandeka. Kuyamangaza ukuthi i-AI ingakwazi ukwenza umculo ngezitayela ezahlukene, ngezinga eliphezulu lamaqiniso. I-MusicLM inamandla okushintsha ibhizinisi lomculo. Futhi, siyajabula ukubuka ukuthi lobu buchwepheshe buvela kanjani.
shiya impendulo