5.3.1 Netflix Prize

The Netflix Prize amfani bude kira zuwa hango ko hasashen abin da fina-finai da mutane za su so.

Mafi kyawun kira na kira shi ne kyautar Netflix. Netflix wani kamfanin kamfani ne na fim din, kuma a shekara ta 2000 ya kaddamar da Cinematch, sabis don bayar da finafinan fina-finai ga abokan ciniki. Alal misali, Cinematch zai iya lura cewa kana son Star Wars da kuma Empire ya Kashe Back sannan kuma ya ba da shawara cewa kayi kalli Return of Jedi . Da farko dai, Cinematch ya yi aiki da talauci. Amma, a tsawon shekarun da suka wuce, ya ci gaba da inganta ikonsa na hango ko wane fina-finai na fina-finai na cin abinci. A shekara ta 2006, duk da haka, ci gaba a kan Cinematch ya fara da kullun. Masu bincike a Netflix sun yi kokari sosai da duk abin da zasu iya tunani, amma, a lokaci guda, suna zargin cewa akwai wasu ra'ayoyin da zasu iya taimaka musu inganta tsarin. Don haka, sun zo tare da abin da ke, a lokacin, wata mahimman bayani: kira mai kira.

Abinda ya dace ga nasarar nasarar Netflix kyauta shine yadda aka kirkiro kira mai kira, kuma wannan zane yana da muhimmiyar darussan yadda za a iya amfani da kira na bude don bincike na zamantakewa. Netflix bai ƙaddamar da neman buƙatar ƙira ba, wanda abin da mutane da yawa ke tunanin lokacin da suka fara la'akari da kira mai budewa. Maimakon haka, Netflix ya kawo matsala mai wuya tare da hanya mai kimantawa: sun kalubalanci mutane suyi amfani da samfurin fina-finan kimanin miliyan 100 don hango komai akan kimanin miliyan 3 (ratings da masu amfani suka yi amma Netflix bai saki ba). Mutumin farko ya kirkiro wani algorithm wanda ya zana kimanin miliyon miliyan 10 wanda aka fi sani da Cinematch zai lashe dala miliyan. Wannan bayyananne da sauƙi don yin amfani da hanyar gwaji - kwatanta bayanan da aka gani tare da bayanan ƙaddamarwa-yana nufin cewa an ƙera Netflix Prize a hanyar da mafita ya fi sauƙin dubawa fiye da samarwa; ya ƙalubalanci kalubalen inganta Cinematch a cikin matsala da ya dace da kira mai kira.

A watan Oktoban 2006, Netflix ta fitar da dataset dauke da nauyin fim din miliyan 100 daga kimanin kimanin abokan ciniki 500,000 (zamuyi la'akari da abubuwan da ke tattare da wannan bayanin a cikin babi na 6). Ana iya fahimtar bayanan Netflix a matsayin babbar matrix wanda shine kimanin abokan ciniki 500,000 da fina-finai 20,000. A cikin wannan matrix, akwai kimanin kimanin miliyan 100 a ma'auni daga taurari zuwa biyar (tebur 5.2). Kalubale shine ya yi amfani da bayanan lura a cikin matrix don hango bayanan farashin da aka gudanar da miliyan 3.

Table 5.2: Tsarin Data daga Kyautar Netflix
Movie 1 Movie 2 Movie 3 ... Movie 20,000
Abokin ciniki 1 2 5 ... ?
Abokin ciniki 2 2 ? ... 3
Abokin ciniki 3 ? 2 ...
\(\vdots\) \(\vdots\) \(\vdots\) \(\vdots\) \(\vdots\)
Abokan ciniki 500,000 ? 2 ... 1

Masu bincike da masu fashin wuta a duniya sun damu da kalubalantar, kuma daga shekara ta 2008 mutane fiye da 30,000 ke aiki a kan (Thompson 2008) . A yayin wannan gwagwarmayar, Netflix ta karbi fiye da dubu 40 da aka samar da shawarar daga kungiyoyin fiye da 5,000 (Netflix 2009) . Babu shakka, Netflix ba zai iya karantawa kuma ya fahimci dukkan waɗannan maganganun da aka kawo ba. Dukkan abin da ya gudana a hankali, duk da haka, saboda mafita sun kasance da sauƙin dubawa. Netflix zai iya samun kwamfutar kwaskwarima tare da bayanan da aka ƙaddara ta amfani da ƙayyadaddun ƙarfin da aka ƙayyade (matakan da suke amfani dasu shine tushen tushen kuskuren kuskure). Yana da wannan damar yin nazarin maganganu da dama wanda ya sa Netflix ya karbi mafita daga kowa da kowa, wanda ya zama mahimmanci saboda ra'ayoyi mai kyau daga wasu wurare masu ban mamaki. A gaskiya ma, ƙungiyar ta samo asali ne ta hanyar samfurin da wasu masu bincike uku suka fara da su ba su da kwarewa wajen kafa tsarin (Bell, Koren, and Volinsky 2010) fim (Bell, Koren, and Volinsky 2010) .

Ɗaya daga cikin kyawawan abubuwan kyautar Netflix ita ce ta sa duk matakan da ake bukata don daidaitawa. Wato, lokacin da mutane suka sauko da ra'ayinsu, ba su buƙatar shigar da takardun shaidar su, shekarunsu, tsere, jinsi, jima'i, ko wani abu game da kansu. Sanarwar da aka kwatanta da sanannen malamin Farfesa daga Stanford an bi da su daidai da wadanda daga matashi a cikin ɗakin kwana. Abin takaici, wannan ba gaskiya ba ne a mafi yawan bincike na zamantakewa. Wato, saboda mafi yawan bincike na zamantakewa, kimantawa yana da lokaci mai yawa kuma yana da ma'ana. Don haka, yawancin binciken binciken ba a yi la'akari sosai ba, kuma idan aka kimanta ra'ayoyin, yana da wuyar kawar da waɗannan kimantawa daga mahaliccin ra'ayoyin. Ayyukan kira na bude, a gefe guda, suna da sauƙin sauƙi don haka zasu iya gano ra'ayoyin da za a rasa idan ba haka ba.

Alal misali, a wata aya a yayin kyautar Netflix, wani mai suna Simon Funk ya wallafa a kan shafinsa wani bayani da aka tsara wanda ya dogara ne akan wani nau'i mai mahimmanci, wanda ya dace da algebra linzamin da ba a taɓa amfani dasu ba daga sauran mahalarta. Funk's blog post ya lokaci guda fasaha da kuma maras kyau informal. Shin wannan shafin yanar gizon ya kwatanta kyakkyawar bayani ko kuma lokacin bata lokaci ne? A waje da aikin kira na budewa, wannan bayani ba zai taɓa samun babban darajar ba. Bayan haka, Simon Funk ba farfesa a MIT ba; Shi masanin software ne wanda, a wannan lokacin, ya kasance mai goyon bayan New Zealand (Piatetsky 2007) . Idan ya aika da wannan ra'ayin ga injiniya a Netflix, ba shakka ba za'a karanta ba.

Abin farin ciki, saboda ka'idodin bincike ya kasance mai sauƙi kuma mai sauƙi a yi amfani da shi, an yi la'akari da ra'ayoyin da aka yi da shi, kuma an bayyana ta a fili cewa tsarin ya kasance mai iko sosai: ya zira kwallo a wuri na hudu a gasar, babban sakamako ya ba da sauran kungiyoyi aiki na watanni a kan matsalar. A ƙarshe, yawancin masu fafatawa (Bell, Koren, and Volinsky 2010) .

Gaskiyar cewa Simon Funk ya zaɓi ya rubuta blog bayan ya bayyana yadda ya dace, maimakon ƙoƙari ya ɓoye shi, ya kuma nuna cewa yawancin masu halartar kyautar Netflix ba su da karfin kyautar dalar Amurka miliyan. Maimakon haka, yawancin masu halartar taron sunyi jin daɗin jin dadin ilimi da kuma al'umma da suka ci gaba da matsala (Thompson 2008) , jin dadin da na tsammanin masu bincike da yawa zasu iya fahimta.

Kyautar Netflix ita ce misali mai kyau na kira mai kira. Netflix ya yi tambaya tare da wasu manufofin (tsinkayar fim din fim) da kuma neman mafita daga mutane da yawa. Netflix ya iya yin nazarin duk waɗannan maganganu saboda sun fi sauƙin dubawa fiye da ƙirƙirar, kuma Netplix ya samu kyakkyawar mafita mafi kyau. Na gaba, Zan nuna maka yadda za a iya amfani da wannan tsarin ta hanyar nazarin halittu da kuma doka, kuma ba tare da kyautar dala miliyan daya ba.