1 00:00:00,000 --> 00:00:05,691 2 00:00:05,691 --> 00:00:07,690 CONNOR HARRIS: Bado mimi kufikiri baadhi ya video ya kusisimua 3 00:00:07,690 --> 00:00:12,570 zinazozalishwa na ushauri mtaalamu ambayo inatumia R mengi katika kazi yake. 4 00:00:12,570 --> 00:00:16,329 >> NARRATOR: Nini nyuma ya takwimu, analytics, na visualizations 5 00:00:16,329 --> 00:00:19,770 data wanasayansi kwamba leo brightest na viongozi wa biashara kutegemea 6 00:00:19,770 --> 00:00:22,012 kufanya maamuzi yenye nguvu? 7 00:00:22,012 --> 00:00:23,540 Wewe huenda daima kuona. 8 00:00:23,540 --> 00:00:24,790 Lakini ni pale. 9 00:00:24,790 --> 00:00:29,460 Ni wito R, wazi chanzo R-- takwimu lugha ya programu 10 00:00:29,460 --> 00:00:32,630 takwimu ambazo wataalam dunia juu ya matumizi ya kila kitu 11 00:00:32,630 --> 00:00:35,350 kutoka ramani pana kijamii na mwenendo wa masoko online 12 00:00:35,350 --> 00:00:39,210 zinazoendelea kifedha na hali ya hewa mifano ya kwamba kusaidia kuendesha uchumi wetu 13 00:00:39,210 --> 00:00:40,780 na jamii. 14 00:00:40,780 --> 00:00:44,910 >> Lakini nini hasa ni R na wapi R kuanza? 15 00:00:44,910 --> 00:00:48,620 Naam awali, R ilianza hapa na maprofesa wawili 16 00:00:48,620 --> 00:00:51,950 aliyetaka takwimu bora jukwaa kwa wanafunzi wao. 17 00:00:51,950 --> 00:00:56,030 Hivyo wao wameumbwa moja inatokana baada lugha takwimu S. 18 00:00:56,030 --> 00:01:00,480 Wao, pamoja na wengine wengi, naendelea kufanya kazi katika na kutumia R, 19 00:01:00,480 --> 00:01:05,489 kujenga zana kwa ajili R na kutafuta maombi mapya kwa ajili ya R kila siku. 20 00:01:05,489 --> 00:01:07,750 >> Shukrani kwa hii ni juhudi jamii duniani kote, 21 00:01:07,750 --> 00:01:11,850 R naendelea kukua kwa maelfu maktaba user-umba kujengwa 22 00:01:11,850 --> 00:01:15,500 kuongeza R utendaji na umati wa watu-sourced ubora uthibitisho 23 00:01:15,500 --> 00:01:19,740 na msaada kutoka kwa wengi kutambuliwa viongozi wa sekta katika kila shamba hilo 24 00:01:19,740 --> 00:01:25,040 anatumia R. ipi ni kubwa, kwa sababu R ni bora katika nini anafanya. 25 00:01:25,040 --> 00:01:28,540 Chipukizi wataalam haraka na urahisi kutafsiri, kiutendaji na, 26 00:01:28,540 --> 00:01:33,790 na taswira data kuonyesha yao kwa kasi kuongezeka jamii ya watumiaji duniani kote R 27 00:01:33,790 --> 00:01:36,380 na kuona ni jinsi wazi chanzo R inaendelea sura 28 00:01:36,380 --> 00:01:39,340 mustakabali wa takwimu uchambuzi na takwimu sayansi. 29 00:01:39,340 --> 00:01:44,660 30 00:01:44,660 --> 00:01:47,710 >> CONNOR HARRIS: Sawa, kubwa. 31 00:01:47,710 --> 00:01:50,360 Hivyo kuwasilisha yangu mwenyewe itakuwa kidogo na kiasi zaidi. 32 00:01:50,360 --> 00:01:54,380 Itakuwa si kuhusisha kiasi hicho kusisimua historia muziki. 33 00:01:54,380 --> 00:01:59,160 Lakini kama wewe aliona katika video, R ni aina ya ujumla lugha madhumuni ya mpango. 34 00:01:59,160 --> 00:02:03,720 Lakini iliundwa hasa kwa kazi za takwimu. 35 00:02:03,720 --> 00:02:07,980 >> Hivyo ni iliyoundwa kwa ajili ya takwimu, kwa uchambuzi wa data, kwa ajili ya madini data. 36 00:02:07,980 --> 00:02:12,420 Na hivyo unaweza kuona hii katika mengi ya uchaguzi mtindo ambao waandaaji wa R 37 00:02:12,420 --> 00:02:13,320 alifanya. 38 00:02:13,320 --> 00:02:15,472 Ni iliyoundwa kwa ajili kiasi kikubwa, watu ambao si 39 00:02:15,472 --> 00:02:17,930 wataalam katika programu, ambao ni kuokota tu juu ya programu 40 00:02:17,930 --> 00:02:23,460 upande ili waweze kufanya kazi zao katika sayansi ya jamii au katika takwimu 41 00:02:23,460 --> 00:02:25,440 au chochote. 42 00:02:25,440 --> 00:02:27,850 >> Ina mengi ya sana tofauti muhimu kutoka C. 43 00:02:27,850 --> 00:02:33,200 Lakini syntax na mitizamo kwamba anatumia ni mapana huo. 44 00:02:33,200 --> 00:02:36,830 Na unapaswa kujisikia pretty kiasi nyumbani haki mbali bat. 45 00:02:36,830 --> 00:02:38,520 Ni lugha muhimu. 46 00:02:38,520 --> 00:02:40,260 >> Je, si wasiwasi sana kuhusu kwamba kama huna kujua mrefu. 47 00:02:40,260 --> 00:02:42,676 Lakini kuna tofauti kati ya muhimu, declarative, 48 00:02:42,676 --> 00:02:43,810 na kazi. 49 00:02:43,810 --> 00:02:47,600 Muhimu njia tu ya kufanya kauli kwamba kimsingi ni amri. 50 00:02:47,600 --> 00:02:52,340 Na kisha mkalimani au kompyuta ifuatavyo nao moja kwa moja. 51 00:02:52,340 --> 00:02:56,630 Ni weakly typed, kuna hakuna maazimio ya aina katika R. 52 00:02:56,630 --> 00:02:59,130 >> Na kisha mistari kati ya aina tofauti 53 00:02:59,130 --> 00:03:03,920 ni kidogo zaidi kuliko huru wao ni katika C, kwa mfano. 54 00:03:03,920 --> 00:03:06,450 Na kama nilivyosema kuna watu vifaa vya makubwa sana 55 00:03:06,450 --> 00:03:15,610 kwa graphing, kwa takwimu uchambuzi, kwa ajili ya madini data. 56 00:03:15,610 --> 00:03:19,540 Hizi ni wawili kujengwa katika lugha na, kama video akasema, 57 00:03:19,540 --> 00:03:23,680 maelfu ya maktaba chama cha tatu kwamba unaweza kushusha na kutumia bila ya malipo 58 00:03:23,680 --> 00:03:25,340 na hali huru sana leseni. 59 00:03:25,340 --> 00:03:28,800 60 00:03:28,800 --> 00:03:31,500 >> Hivyo kwa ujumla, Ningependa kupendekeza kwamba ukiangalia vitabu hivi viwili 61 00:03:31,500 --> 00:03:34,610 kama wewe ni kwenda kufanya kazi kwenye R. Moja mmoja wao ni rasmi R Beginner 62 00:03:34,610 --> 00:03:35,110 kuongoza. 63 00:03:35,110 --> 00:03:38,660 Ni iimarishwe na watengenezaji msingi ya R. 64 00:03:38,660 --> 00:03:42,400 Unaweza kushusha ni tena, bila ya malipo na kisheria wakati huo kiungo hapo. 65 00:03:42,400 --> 00:03:45,430 66 00:03:45,430 --> 00:03:49,869 Slides yote haya ni kwenda up kwenye mtandao, juu ya CS50 tovuti 67 00:03:49,869 --> 00:03:50,660 baada ya hii ni kosa. 68 00:03:50,660 --> 00:03:53,690 Hivyo hakuna haja ya nakala mambo chini frantically. 69 00:03:53,690 --> 00:03:56,800 >> Mtu mwingine ni vitabu vya kiada na Cosma Shalizi, 70 00:03:56,800 --> 00:04:00,100 ambaye ni profesa takwimu katika Carnegie Mellon, aitwaye Juu Takwimu 71 00:04:00,100 --> 00:04:02,160 Uchambuzi kutoka Point ya msingi ya View. 72 00:04:02,160 --> 00:04:04,010 Hii si hasa kitabu R. 73 00:04:04,010 --> 00:04:07,130 Ni kitabu takwimu na ni kitabu uchambuzi wa data. 74 00:04:07,130 --> 00:04:11,990 Lakini ni kupatikana sana kwa watu ambao na kiasi kidogo cha takwimu maarifa. 75 00:04:11,990 --> 00:04:13,750 >> Sijawahi kuchukuliwa kozi rasmi. 76 00:04:13,750 --> 00:04:17,269 Mimi tu kujua vipande vipande kutoka washirika masomo mbalimbali 77 00:04:17,269 --> 00:04:18,579 kwamba nimekuwa kuchukuliwa kozi katika. 78 00:04:18,579 --> 00:04:21,839 Na mimi alikuwa na uwezo wa kuelewa ni kikamilifu vizuri. 79 00:04:21,839 --> 00:04:25,630 >> Takwimu zote wanapewa katika R. Wao ni kufanywa katika R 80 00:04:25,630 --> 00:04:30,280 na wao pia wana nyimbo kificho chini ya kila takwimu kwamba kukuambia 81 00:04:30,280 --> 00:04:33,270 jinsi ya kufanya kila takwimu na R kanuni. 82 00:04:33,270 --> 00:04:37,400 Na hiyo ndiyo muhimu sana kama wewe ni kujaribu kuiga 83 00:04:37,400 --> 00:04:38,650 baadhi takwimu, unaweza kuona katika kitabu. 84 00:04:38,650 --> 00:04:47,840 >> Na tena bure shusha stat.cmu.edu/cshalizi/ Samahani, 85 00:04:47,840 --> 00:04:50,230 kwamba lazima kufyeka tilde cshalizi. 86 00:04:50,230 --> 00:04:53,150 Mimi itabidi kuhakikisha kusahihisha kwamba wakati slides rasmi kwenda juu. 87 00:04:53,150 --> 00:04:57,000 / ADAfaEPoV ambayo ni haki kifupi cha jina kitabu. 88 00:04:57,000 --> 00:04:59,850 89 00:04:59,850 --> 00:05:02,500 >> Hivyo ujumla caveats-- R ina mengi ya uwezo. 90 00:05:02,500 --> 00:05:05,331 Mimi tu kwenda kuwa na uwezo wa kufunika uso wa mambo mengi. 91 00:05:05,331 --> 00:05:08,580 Pia sehemu ya kwanza ya semina ni kwenda kuwa kitu cha data dampo. 92 00:05:08,580 --> 00:05:11,437 Nasikitika kabisa juu ya hilo. 93 00:05:11,437 --> 00:05:13,770 Kimsingi, mimi nina kwenda kwa kuanzisha wewe mambo mengi 94 00:05:13,770 --> 00:05:15,350 haki mbali bat, kwenda haraka iwezekanavyo. 95 00:05:15,350 --> 00:05:17,058 Na kisha sisi kupata sehemu ya kujifurahisha, ambayo ni 96 00:05:17,058 --> 00:05:20,570 demo ambapo naweza kuonyesha kila kitu kwamba tumekuwa kuongelea juu ya screen. 97 00:05:20,570 --> 00:05:23,321 Na unaweza kucheza karibu juu yako mwenyewe. 98 00:05:23,321 --> 00:05:26,070 Kwa hiyo, kuna kwenda kuwa mengi ya mambo ya kiufundi kimesababisha hapa. 99 00:05:26,070 --> 00:05:28,060 Msiwe na wasiwasi juu ya kuiga kila kitu chini. 100 00:05:28,060 --> 00:05:31,740 Kwa sababu, unaweza kupata yote mambo juu ya CS50 tovuti baadaye. 101 00:05:31,740 --> 00:05:37,780 Na B, siyo kweli kwamba muhimu kukariri hii kutoka slides. 102 00:05:37,780 --> 00:05:40,462 Ni muhimu zaidi kwamba kupata baadhi kituo Intuitive na hayo 103 00:05:40,462 --> 00:05:44,220 na kwamba linatokana na kucheza tu karibu. 104 00:05:44,220 --> 00:05:45,720 >> Hivyo kwa nini kutumia R? 105 00:05:45,720 --> 00:05:49,440 Kimsingi, kama una mradi huo inahusisha madini seti kubwa data, data 106 00:05:49,440 --> 00:05:52,664 taswira, wewe unapaswa kutumia R. Kama uko 107 00:05:52,664 --> 00:05:55,830 kufanya uchambuzi wa takwimu ngumu, kwamba itakuwa vigumu katika Excel, 108 00:05:55,830 --> 00:05:58,010 Kwa mfano, ingekuwa pia inaweza wakatenda mema, pia 109 00:05:58,010 --> 00:06:00,506 kama wewe ni kufanya takwimu Uchambuzi hiyo moja kwa moja. 110 00:06:00,506 --> 00:06:02,130 Hebu sema wewe ni kudumisha tovuti. 111 00:06:02,130 --> 00:06:06,320 Na unataka kusoma gogo server kila siku na kukusanya baadhi ya orodha, 112 00:06:06,320 --> 00:06:10,320 kama nchi juu kwamba watumiaji yako ni kuja kutoka, 113 00:06:10,320 --> 00:06:15,100 baadhi ya takwimu muhtasari juu ya muda gani wanatumia kwenye tovuti yako au chochote. 114 00:06:15,100 --> 00:06:16,910 Na unataka kukimbia huu kila siku. 115 00:06:16,910 --> 00:06:20,280 >> Sasa kama wewe ni kufanya hili katika Excel, d kuwa na kwenda kwa server logi yako, 116 00:06:20,280 --> 00:06:23,490 kuagiza kwamba katika Excel spreadsheet data, 117 00:06:23,490 --> 00:06:24,910 kukimbia uchambuzi zote manually. 118 00:06:24,910 --> 00:06:27,100 Pamoja R, unaweza tu kuandika script moja. 119 00:06:27,100 --> 00:06:29,520 Ratiba hiyo kuendesha kila siku kutoka mfumo wa uendeshaji wako. 120 00:06:29,520 --> 00:06:33,657 Na kisha kila usiku saa 2:00 asubuhi, au wakati wowote ratiba yake ya kuendesha, 121 00:06:33,657 --> 00:06:35,990 itakuwa kuangalia njia yako internet trafiki kwa siku hiyo. 122 00:06:35,990 --> 00:06:39,010 Na kisha kwa siku ya pili, utasikia kuwa, ripoti hii shiny mpya 123 00:06:39,010 --> 00:06:41,710 au chochote na wote wa Maelezo akaomba. 124 00:06:41,710 --> 00:06:44,960 125 00:06:44,960 --> 00:06:50,217 >> Hivyo kimsingi R ni kwa Cisco programu dhidi Cisco uchambuzi. 126 00:06:50,217 --> 00:06:51,050 Awali ni kosa. 127 00:06:51,050 --> 00:06:53,104 Hebu kupata katika mambo halisi. 128 00:06:53,104 --> 00:06:55,020 Hivyo kuna tatu halisi aina katika lugha. 129 00:06:55,020 --> 00:06:56,120 Kuna numeric aina. 130 00:06:56,120 --> 00:07:01,250 Kuna aina ya tofauti kati integers na pointi yaliyo, 131 00:07:01,250 --> 00:07:02,769 lakini si kweli. 132 00:07:02,769 --> 00:07:04,560 Kuna tabia aina, ambayo ni masharti. 133 00:07:04,560 --> 00:07:07,100 Na kuna mantiki aina, ambayo ni Booleans. 134 00:07:07,100 --> 00:07:11,080 >> Na unaweza kubadilisha kati ya aina kutumia kazi hizo kama numeric, 135 00:07:11,080 --> 00:07:15,220 kama tabia, kama mantiki. 136 00:07:15,220 --> 00:07:17,510 Kama wewe piga, kwa mfano, kama numeric kwenye kamba, 137 00:07:17,510 --> 00:07:20,030 itakuwa kujaribu kusoma kamba kwamba kama idadi, njia ile ile 138 00:07:20,030 --> 00:07:25,897 kwamba a2i na scanf kufanya, na C. Kama wewe piga kama numeric kwenye kweli au uongo 139 00:07:25,897 --> 00:07:26,980 itakuwa kubadilisha na 1 au 0. 140 00:07:26,980 --> 00:07:29,110 Kama wewe piga kama tabia juu ya kitu chochote ni itabidi 141 00:07:29,110 --> 00:07:32,550 kubadilisha kwamba katika uwakilishi kamba. 142 00:07:32,550 --> 00:07:34,990 >> Halafu kulikuwa na wadudu na matrices. 143 00:07:34,990 --> 00:07:37,580 Hivyo kusukuma kimsingi 1 dimensional arrays. 144 00:07:37,580 --> 00:07:40,600 Wao ni kile tunachokiita arrays katika C. Matrices, 2 dimensional arrays. 145 00:07:40,600 --> 00:07:42,350 Na kisha juu dimensional arrays unaweza 146 00:07:42,350 --> 00:07:48,560 3, 4, 5 vipimo au chochote maadili numeric, ya masharti, 147 00:07:48,560 --> 00:07:52,860 ya mantiki maadili. 148 00:07:52,860 --> 00:07:55,380 >> Unaweza pia kuwa na orodha ambayo ni aina ya safu associative. 149 00:07:55,380 --> 00:07:57,390 Mimi itabidi kupata katika kwamba kidogo. 150 00:07:57,390 --> 00:07:59,390 Kwa hiyo, jambo moja muhimu kwamba safari hadi watu katika R 151 00:07:59,390 --> 00:08:01,470 ni kwamba hakuna halisi, safi atomic aina. 152 00:08:01,470 --> 00:08:05,870 Hakuna tofauti halisi kati idadi, kama thamani numeric, 153 00:08:05,870 --> 00:08:07,920 na orodha ya maadili nambari. 154 00:08:07,920 --> 00:08:12,370 Maadili numeric kweli sawa na vectors ya urefu 1. 155 00:08:12,370 --> 00:08:14,959 Na hii ina idadi ya athari muhimu. 156 00:08:14,959 --> 00:08:17,500 Moja, ina maana kwamba unaweza kufanya mambo kwa urahisi sana ambazo zinahusisha 157 00:08:17,500 --> 00:08:21,037 kama vile kuongeza idadi ya vector. 158 00:08:21,037 --> 00:08:23,120 R mapenzi kimsingi takwimu nje nini maana ya jambo hilo. 159 00:08:23,120 --> 00:08:24,610 Na mimi itabidi kupata kwamba katika pili. 160 00:08:24,610 --> 00:08:27,930 Pia ina maana kwamba hakuna njia kwa ajili ya aina checker-- kwa kiasi 161 00:08:27,930 --> 00:08:30,530 kwamba kitu kama hicho ipo katika R-- kuwaambia 162 00:08:30,530 --> 00:08:33,780 wakati umepita katika thamani moja wakati inatarajia safu au kinyume chake. 163 00:08:33,780 --> 00:08:39,159 Na kwamba inaweza kusababisha baadhi ya isiyo ya kawaida matatizo ambayo mimi mbio katika wakati 164 00:08:39,159 --> 00:08:42,252 Mimi nilikuwa kutumia R wakati wa majira yangu kazi. 165 00:08:42,252 --> 00:08:43,710 Na hakuna arrays mchanganyiko wa aina. 166 00:08:43,710 --> 00:08:46,543 Hivyo huwezi kuwa na safu walikuwa mambo ya kwanza ni, mimi sijui, 167 00:08:46,543 --> 00:08:49,332 kamba "John" na kipengele cha pili ni namba 42. 168 00:08:49,332 --> 00:08:52,540 Kama wewe kujaribu kufanya hivyo, basi utasikia kupata kila kitu tu kubadilishwa kwa kamba. 169 00:08:52,540 --> 00:08:54,760 Hivyo tuna kamba John, kamba 42. 170 00:08:54,760 --> 00:08:58,250 171 00:08:58,250 --> 00:09:02,025 >> Kisintaksia hivyo kawaida features-- zaidi R syntax ni sawa na C. 172 00:09:02,025 --> 00:09:04,690 Kuna wachache tofauti muhimu. 173 00:09:04,690 --> 00:09:05,620 Kuandika ni dhaifu sana. 174 00:09:05,620 --> 00:09:07,360 Hivyo hakuna maazimio kutofautiana. 175 00:09:07,360 --> 00:09:12,670 Zoezi anatumia ajabu makosa operator chini ya hyphen. 176 00:09:12,670 --> 00:09:15,340 Maoni ni pamoja na alama hash. 177 00:09:15,340 --> 00:09:19,230 Nadhani sasa siku sisi kuiita alama ingawa hiyo si kweli accurate-- si 178 00:09:19,230 --> 00:09:21,810 mara mbili kufyeka. 179 00:09:21,810 --> 00:09:24,710 >> Mabaki Msimu ni kwa ishara %%. 180 00:09:24,710 --> 00:09:30,172 Integer mgawanyiko ni pamoja na% /% ambayo ni ngumu sana kusoma wakati ni makadirio 181 00:09:30,172 --> 00:09:30,880 up kwenye screen. 182 00:09:30,880 --> 00:09:34,150 183 00:09:34,150 --> 00:09:37,200 Unaweza kupata safu ya integers kwa matumbo. 184 00:09:37,200 --> 00:09:41,840 Hivyo 2,5 nitakupa vector ya namba zote 2 kwa njia ya 5. 185 00:09:41,840 --> 00:09:44,530 >> Arrays ni mmoja indexed, ambayo screws mengi ya watu 186 00:09:44,530 --> 00:09:47,540 up kama uko kutoka zaidi Lugha ya kawaida ya programu, 187 00:09:47,540 --> 00:09:50,450 kama C, ambapo zaidi mambo ni sifuri indexed. 188 00:09:50,450 --> 00:09:54,420 Tena, hii ni mahali ambapo urithi R kama lugha ya kama si 189 00:09:54,420 --> 00:09:56,560 programmers wataalamu huja katika. 190 00:09:56,560 --> 00:09:59,680 Kama uko mwanasosholojia au mchumi au kitu 191 00:09:59,680 --> 00:10:01,980 na wewe ni kujaribu kutumia R kimsingi kama adjunct 192 00:10:01,980 --> 00:10:03,832 kwa yako muhimu zaidi kazi kitaaluma, 193 00:10:03,832 --> 00:10:06,040 wewe ni kwenda kupata mmoja Indexing kidogo zaidi ya asili. 194 00:10:06,040 --> 00:10:09,890 Kwa sababu wewe kuanza kuhesabu saa 1 katika maisha ya kila siku, si 0. 195 00:10:09,890 --> 00:10:13,260 >> Kwa-tanzi, hii ni sawa na foreach kujenga katika PHP, 196 00:10:13,260 --> 00:10:17,090 ambao utakuwa kupata kujifunza in-- pretty hivi karibuni. 197 00:10:17,090 --> 00:10:22,540 Ambayo ni kwa thamani katika vector na basi unaweza kufanya mambo kwa thamani. 198 00:10:22,540 --> 00:10:24,040 Watazamaji: Hiyo kuja katika hotuba. 199 00:10:24,040 --> 00:10:26,248 CONNOR HARRIS: Oh, hiyo ni kuja hotuba, bora. 200 00:10:26,248 --> 00:10:29,815 Watazamaji: zoezi, je, ni zinatakiwa uhakika kutoka kulia kwenda kushoto? 201 00:10:29,815 --> 00:10:31,440 CONNOR HARRIS: Kuanzia kulia kwenda kushoto, ndiyo. 202 00:10:31,440 --> 00:10:34,720 Unaweza kufikiria kama thamani ya haki kumfanyia fujo katika kutofautiana 203 00:10:34,720 --> 00:10:36,240 kwa kushoto. 204 00:10:36,240 --> 00:10:36,781 Watazamaji: Sawa. 205 00:10:36,781 --> 00:10:39,770 206 00:10:39,770 --> 00:10:42,330 >> CONNOR HARRIS: Na hatimaye kazi syntax ni ajabu kidogo. 207 00:10:42,330 --> 00:10:48,460 Una jina kazi foo, kupewa kwa kazi hii keyword, ikifuatiwa 208 00:10:48,460 --> 00:10:51,530 na wote hoja na kisha mwili wa kazi baada ya hapo. 209 00:10:51,530 --> 00:10:53,280 Tena mambo hayo may kuonekana kidogo ajabu. 210 00:10:53,280 --> 00:10:57,181 Wao itabidi kuwa asili ya pili baada ya kazi na lugha kwa kidogo. 211 00:10:57,181 --> 00:10:58,930 Hivyo wadudu, njia wewe kujenga vector 212 00:10:58,930 --> 00:11:04,550 ni wewe aina C, ambayo ni keyword, kisha namba zote unataka au masharti 213 00:11:04,550 --> 00:11:06,490 au chochote. 214 00:11:06,490 --> 00:11:07,995 Hoja pia kuwa wadudu. 215 00:11:07,995 --> 00:11:09,620 Lakini safu kusababisha anapata bapa. 216 00:11:09,620 --> 00:11:14,385 Hivyo huwezi kuwa na arrays ambapo baadhi ya vipengele ni idadi single 217 00:11:14,385 --> 00:11:17,010 na baadhi ya vipengele ni arrays wenyewe. 218 00:11:17,010 --> 00:11:20,010 >> Hivyo kama wewe kujaribu kujenga safu walikuwa kitu cha kwanza ni 4 219 00:11:20,010 --> 00:11:22,370 na kipengele cha pili ni safu 3,5 utasikia 220 00:11:22,370 --> 00:11:25,890 tu kupata mambo matatu safu, 4,3,5. 221 00:11:25,890 --> 00:11:27,760 Hawawezi kuwa na mchanganyiko wa aina. 222 00:11:27,760 --> 00:11:32,290 Kama wewe kujaribu kusoma wala kuandika nje ya mipaka ya vector 223 00:11:32,290 --> 00:11:36,640 utasikia kupata thamani huu inaitwa NA a ambayo inasimamia kwa thamani kukosa. 224 00:11:36,640 --> 00:11:39,900 Na hii ni lengo kwa ajili kama wanatakwimu ambao 225 00:11:39,900 --> 00:11:43,080 ni kufanya kazi na pungufu seti data. 226 00:11:43,080 --> 00:11:46,460 >> Kama wewe kutumia kazi kwamba walidhani kuchukua namba moja tu kwa safu 227 00:11:46,460 --> 00:11:49,220 basi nini utapata ni, kazi itakuwa ramani juu ya safu. 228 00:11:49,220 --> 00:11:52,130 Hivyo kama kazi yako hebu sema inachukua idadi na anarudi ni mraba. 229 00:11:52,130 --> 00:11:58,170 Kuomba kwamba kwa safu 2,3,5 Nini utapata ni safu 4,9,25. 230 00:11:58,170 --> 00:12:00,010 >> Na hiyo ndiyo muhimu sana kwa sababu ina maana wewe 231 00:12:00,010 --> 00:12:03,374 Si lazima kuandika kwa mizunguko kwa kufanya mambo rahisi sana kama kutumia 232 00:12:03,374 --> 00:12:05,040 kazi kwa wanachama wote wa kuweka data. 233 00:12:05,040 --> 00:12:08,557 Ambayo kama wewe ni kufanya kazi na kubwa seti data, una kufanya mengi. 234 00:12:08,557 --> 00:12:10,390 Kazi mapacha ni kutumiwa kuingia kwa kuingia. 235 00:12:10,390 --> 00:12:12,430 Mimi itabidi kupata katika jambo hilo. 236 00:12:12,430 --> 00:12:16,750 Unaweza kupata yao kwa arrays au wadudu na mabano mraba. 237 00:12:16,750 --> 00:12:22,300 Mabano hivyo jina vector mraba 1 nitakupa kipengele kwanza. 238 00:12:22,300 --> 00:12:25,510 Jina Vector mraba mabano 2 nitakupa kipengele cha pili. 239 00:12:25,510 --> 00:12:27,530 >> Unaweza kupita juu vector ya fahirisi na utasikia 240 00:12:27,530 --> 00:12:29,640 kupata nyuma nje kimsingi sababu ndogo. 241 00:12:29,640 --> 00:12:34,990 Hivyo unaweza kufanya vector mabano jina C, 2,4 na utapata nje vector zenye 242 00:12:34,990 --> 00:12:38,804 pili na ya nne mambo ya safu. 243 00:12:38,804 --> 00:12:40,720 Na kama unataka tu haraka muhtasari takwimu 244 00:12:40,720 --> 00:12:47,529 ya vector kama interquartile mbalimbali, wastani, kiwango cha juu, chochote, 245 00:12:47,529 --> 00:12:49,820 unaweza aina tu muhtasari jina vector na kupata kwamba nje. 246 00:12:49,820 --> 00:12:52,680 Hiyo si muhimu kweli kweli katika programu lakini kama wewe ni kucheza 247 00:12:52,680 --> 00:12:55,990 karibu data seti, ni Handy. 248 00:12:55,990 --> 00:12:58,650 >> Matrices-- kimsingi juu ya pande arrays. 249 00:12:58,650 --> 00:13:01,190 Wana huu nukuu syntax maalum. 250 00:13:01,190 --> 00:13:07,620 Tumbo na safu kwamba anapata kujazwa in-- pole, tumbo na data, 251 00:13:07,620 --> 00:13:09,780 idadi ya mistari, idadi ya nguzo. 252 00:13:09,780 --> 00:13:13,180 Wakati una baadhi ya data, huijaza katika safu kimsingi kwenda juu hadi chini 253 00:13:13,180 --> 00:13:13,380 kwanza. 254 00:13:13,380 --> 00:13:14,190 Kisha kushoto kwenda kulia. 255 00:13:14,190 --> 00:13:15,030 Hivyo, kama hiyo. 256 00:13:15,030 --> 00:13:17,809 257 00:13:17,809 --> 00:13:19,600 Na R imejenga katika tumbo kuzidisha, 258 00:13:19,600 --> 00:13:24,310 spectral mtengano, diagonalization, mambo mengi. 259 00:13:24,310 --> 00:13:27,785 Kama unataka juu ya pande arrays, hivyo 3, 4, 5, 260 00:13:27,785 --> 00:13:29,410 au chochote vipimo unaweza kufanya hivyo. 261 00:13:29,410 --> 00:13:34,400 Syntax ni safu finyu sawa na c, kisha orodha ya vipimo. 262 00:13:34,400 --> 00:13:38,620 Hivyo kama unataka 4 dimensional safu na vipimo 4, 7, 8, 9, safu, 263 00:13:38,620 --> 00:13:45,470 hafifu sawa na c (4,7,8,9). 264 00:13:45,470 --> 00:13:51,180 >> Upatikanaji maadili moja kwa mabano kwanza kuingia comma kuingia wa pili. 265 00:13:51,180 --> 00:13:54,870 Unaweza kupata vipande nzima ya safu au nguzo. 266 00:13:54,870 --> 00:13:59,900 Na syntax hii ni pungufu tu mstari idadi comma au comma safu 267 00:13:59,900 --> 00:14:00,400 idadi. 268 00:14:00,400 --> 00:14:02,874 269 00:14:02,874 --> 00:14:04,540 Hivyo orodha ni aina ya kuhusishwa safu. 270 00:14:04,540 --> 00:14:06,360 Wana syntax yao wenyewe hapa. 271 00:14:06,360 --> 00:14:08,320 Tena hawana frantically nakala chini hawa wote. 272 00:14:08,320 --> 00:14:11,370 Hii ni ili watu kwenda kwa slides baadaye 273 00:14:11,370 --> 00:14:13,089 na hii yote katika kumbukumbu nzuri. 274 00:14:13,089 --> 00:14:16,130 Na hii itakuwa mara moja sana ya asili Mimi kwa kweli kutembea kwa njia ya demos. 275 00:14:16,130 --> 00:14:19,295 276 00:14:19,295 --> 00:14:20,920 Hivyo unaorodhesha arrays kimsingi yanayohusiana. 277 00:14:20,920 --> 00:14:27,040 Unaweza kupata maadili na orodha jina, ishara ya dola, muhimu. 278 00:14:27,040 --> 00:14:31,370 Hivyo kama orodha yako ni jina foo, basi unaweza kupata hiyo kama hicho. 279 00:14:31,370 --> 00:14:37,032 Unaweza kupata yote jozi ufunguo thamani kwa kupita katika mraba mabano ripoti. 280 00:14:37,032 --> 00:14:39,240 Kama unaweza kusoma kutoka haupo ufunguo, utapata null. 281 00:14:39,240 --> 00:14:41,150 Itakuwa si kosa. 282 00:14:41,150 --> 00:14:43,590 Suala hapa ni, R tutafanya kama wewe sana na null kama wanaweza. 283 00:14:43,590 --> 00:14:46,580 Na hii ina maana kwamba kama wewe ni hatarajii kupata null nje 284 00:14:46,580 --> 00:14:51,840 baadhi orodha kusoma, utapata baadhi haitabiriki makosa zaidi chini 285 00:14:51,840 --> 00:14:52,620 mstari. 286 00:14:52,620 --> 00:14:54,890 >> Hii ilitokea kwangu yangu majira ya kazi wakati nilikuwa kutumia R 287 00:14:54,890 --> 00:14:58,410 ambapo mimi iliyopita jinsi baadhi ya orodha ilipewa katika doa moja 288 00:14:58,410 --> 00:15:05,410 lakini hakuna mabadiliko baadaye kificho kwamba kusoma maadili kutoka humo. 289 00:15:05,410 --> 00:15:10,190 Na hivyo kile kilichotokea ilikuwa Nilikuwa kusoma null maadili nje ya orodha hii, 290 00:15:10,190 --> 00:15:13,090 kuvigawa katika kazi, na kuwa kuchanganyikiwa sana 291 00:15:13,090 --> 00:15:16,000 nilipofika kila aina ya infinities random mseto juu 292 00:15:16,000 --> 00:15:16,790 katika kazi hii. 293 00:15:16,790 --> 00:15:20,730 Kwa sababu kama wewe kuomba fulani upeo au kazi chini null, 294 00:15:20,730 --> 00:15:22,570 utasikia kupata maadili usio nje. 295 00:15:22,570 --> 00:15:26,400 296 00:15:26,400 --> 00:15:29,180 >> Takwimu muafaka, wao uko Aina ya orodha. 297 00:15:29,180 --> 00:15:31,170 Kila thamani ya kitu vector ya urefu sawa. 298 00:15:31,170 --> 00:15:34,220 Na wao ni kutumika kwa ajili ya kuwasilisha, kimsingi, meza data. 299 00:15:34,220 --> 00:15:36,175 Kuna hii syntax initialization. 300 00:15:36,175 --> 00:15:38,800 Hii itakuwa zote, tena, kuwa na kiasi wazi wakati kupata demo. 301 00:15:38,800 --> 00:15:42,240 302 00:15:42,240 --> 00:15:44,240 Jambo zuri na muafaka data ni kwamba 303 00:15:44,240 --> 00:15:49,380 inaweza kutoa majina kwa nguzo zote na majina ya safu yote. 304 00:15:49,380 --> 00:15:53,890 Na hivyo kwamba inafanya kupata yao kidogo friendlier. 305 00:15:53,890 --> 00:15:59,130 Pia hii ni jinsi kazi zaidi kwamba kusoma katika takwimu kutoka Excel 306 00:15:59,130 --> 00:16:03,820 au kutoka Nakala files, kwa mfano, kusoma katika takwimu zao. 307 00:16:03,820 --> 00:16:07,555 Wao itabidi kuiweka katika aina fulani ya sura data. 308 00:16:07,555 --> 00:16:09,680 Hivyo functions-- kazi syntax ni kidogo weird. 309 00:16:09,680 --> 00:16:16,160 Tena ni jina la kazi, kuwapa, hii kazi keyword na kisha 310 00:16:16,160 --> 00:16:17,900 orodha ya hoja. 311 00:16:17,900 --> 00:16:24,080 Hivyo kuna baadhi ya mambo mazuri kuhusu jinsi kazi kazi hapa. 312 00:16:24,080 --> 00:16:28,170 Kwa moja, unaweza kweli kuwapa maadili ya msingi kwa hoja fulani. 313 00:16:28,170 --> 00:16:32,910 Hivyo unaweza kusema R1 equals-- unaweza kusema foo 314 00:16:32,910 --> 00:16:38,290 ni kazi ambapo R1 sawa na kitu na msingi kama user bayana 315 00:16:38,290 --> 00:16:39,090 hakuna hoja. 316 00:16:39,090 --> 00:16:41,932 Vinginevyo, ni chochote kuweka katika. 317 00:16:41,932 --> 00:16:44,140 Na hii ni Handy sana kwa sababu mengi ya kazi yetu 318 00:16:44,140 --> 00:16:47,910 na mara nyingi kadhaa au mamia ya hoja. 319 00:16:47,910 --> 00:16:51,210 Kwa mfano ndio kwa kupanga njama grafu au kupanga njama kuwatawanya viwanja 320 00:16:51,210 --> 00:16:54,430 na hoja kwamba kudhibiti kila kitu kutoka kichwa na mhimili 321 00:16:54,430 --> 00:16:59,512 maandiko kwa rangi ya mistari kurudi nyuma. 322 00:16:59,512 --> 00:17:01,470 Na hivyo kama wewe hawataki kuwafanya watu kutaja 323 00:17:01,470 --> 00:17:04,050 kila mmoja wa haya mamia ya hoja 324 00:17:04,050 --> 00:17:07,674 kudhibiti kila nyanja moja ya njama au kurudi nyuma au chochote, 325 00:17:07,674 --> 00:17:09,299 ni nzuri na kuwa na maadili haya ya msingi. 326 00:17:09,299 --> 00:17:12,700 327 00:17:12,700 --> 00:17:19,146 >> Na kisha unaweza kweli kuandika kama wewe aliona nyuma hapa. 328 00:17:19,146 --> 00:17:22,869 Au kupata mfano bora. 329 00:17:22,869 --> 00:17:28,690 Wakati wewe piga kazi unaweza kweli kuwaita kwa kutumia majina hoja. 330 00:17:28,690 --> 00:17:33,919 Hivyo hapa ni mfano wa tumbo kuujenga. 331 00:17:33,919 --> 00:17:34,960 Inachukua hoja tatu. 332 00:17:34,960 --> 00:17:36,760 Kwa kawaida una data, ambayo ni vector. 333 00:17:36,760 --> 00:17:38,920 Una N mstari, ambayo ni idadi ya safu. 334 00:17:38,920 --> 00:17:41,160 Una N cols-- idadi ya nguzo. 335 00:17:41,160 --> 00:17:43,920 Suala hapa ni kama aina N mstari sawa na chochote 336 00:17:43,920 --> 00:17:46,520 na N Kanali sawa na chochote wakati wewe ni wito kazi hii, 337 00:17:46,520 --> 00:17:47,770 unaweza kweli kubadili yao. 338 00:17:47,770 --> 00:17:51,590 Hivyo unaweza kuweka N Kanali kwanza na N mstari pili na itafanya hakuna tofauti. 339 00:17:51,590 --> 00:17:54,660 Hivyo hiyo ni nzuri kidogo kipengele. 340 00:17:54,660 --> 00:17:56,260 >> Je, kuagiza na kuuza nje. 341 00:17:56,260 --> 00:18:00,010 Hii inaweza kufanyika, kimsingi. 342 00:18:00,010 --> 00:18:03,816 Pia kuna vituo vya kuandika holela R vitu kwa faili binary 343 00:18:03,816 --> 00:18:05,190 na kisha kusoma yao nyuma katika baadaye. 344 00:18:05,190 --> 00:18:08,030 Ambayo ni sehemu za kama wewe ni kufanya kubwa mwingiliano kikao R 345 00:18:08,030 --> 00:18:12,850 na unahitaji kuokoa mambo kwa haraka sana. 346 00:18:12,850 --> 00:18:16,460 By default R ina directory kazi kuwa mafaili kupata kuandikwa kutoka ndani ya 347 00:18:16,460 --> 00:18:19,410 na kusoma nyuma katika kutoka. 348 00:18:19,410 --> 00:18:22,350 Unaweza kuona kwamba pamoja na getwg, mabadiliko hayo kwa setdw. 349 00:18:22,350 --> 00:18:25,630 Hakuna kitu ya kuvutia hasa hapa 350 00:18:25,630 --> 00:18:28,270 >> Hivyo sasa takwimu halisi stuff-- multilinear regression. 351 00:18:28,270 --> 00:18:30,960 352 00:18:30,960 --> 00:18:34,910 Hivyo syntax kawaida ni kidogo ngumu. 353 00:18:34,910 --> 00:18:37,260 Mfano ni kitu kubwa kimsingi. 354 00:18:37,260 --> 00:18:39,910 Anapata ajili ya Lm, ambayo ni kazi ya wito. 355 00:18:39,910 --> 00:18:43,840 Kitu cha kwanza, y tilde x1 pamoja na chochote. 356 00:18:43,840 --> 00:18:46,574 357 00:18:46,574 --> 00:18:47,990 Syntax yangu hapa ni kidogo utata. 358 00:18:47,990 --> 00:18:49,490 Nasikitika kabisa, hii ni njia standard 359 00:18:49,490 --> 00:18:50,990 kwamba vitabu vya sayansi ya kompyuta kufanya hivyo. 360 00:18:50,990 --> 00:18:54,890 Lakini ni kidogo weird. 361 00:18:54,890 --> 00:18:58,200 >> Hivyo kimsingi, ni lm mabano, kwanza bidhaa 362 00:18:58,200 --> 00:19:06,730 ni kutofautiana pole, tegemezi kutofautiana tilde x1 pamoja x2 pamoja 363 00:19:06,730 --> 00:19:10,910 Hata hivyo wengi wa kujitegemea vigezo una. 364 00:19:10,910 --> 00:19:14,240 Na kisha hizi yanaweza ama kuwa vectors, urefu wote huo. 365 00:19:14,240 --> 00:19:16,220 Au wanaweza kuwa safu headers katika sura data 366 00:19:16,220 --> 00:19:18,553 kwamba wewe tu bayana katika pili hoja data sura. 367 00:19:18,553 --> 00:19:23,270 368 00:19:23,270 --> 00:19:26,380 >> Unaweza pia kutaja fomula ngumu zaidi 369 00:19:26,380 --> 00:19:31,990 hivyo huna kwa mstari regress moja tegemezi kutofautiana, 370 00:19:31,990 --> 00:19:34,440 au vector moja juu ya vector ya awali iliyopo. 371 00:19:34,440 --> 00:19:38,070 Unaweza kufanya, kwa mfano, a sehemu vector y mraba pamoja na 1 372 00:19:38,070 --> 00:19:42,100 na regress kwamba dhidi kuingia ya baadhi vector mengine. 373 00:19:42,100 --> 00:19:45,200 Unaweza magazeti muhtasari wa mfano wa kuigwa na amri hii inaitwa 374 00:19:45,200 --> 00:19:48,607 summary-- muhtasari tu parens mfano wa kuigwa. 375 00:19:48,607 --> 00:19:50,190 Tena kitu kingine ni lazima kufafanua. 376 00:19:50,190 --> 00:19:55,407 377 00:19:55,407 --> 00:19:58,615 Kitu kingine kwamba kupata kusahihishwa wakati slides kwenda kwenye mtandao. 378 00:19:58,615 --> 00:20:01,127 379 00:20:01,127 --> 00:20:03,210 Kama unataka tu kwa mahesabu ya uwiano rahisi 380 00:20:03,210 --> 00:20:09,170 unaweza kutumia uwiano vector 1 vector 2 kazi ya msingi. 381 00:20:09,170 --> 00:20:11,856 Mbinu ni kwa default Pearson mahusiano. 382 00:20:11,856 --> 00:20:13,480 Hao ndio kiwango cha unaweza kufanya. 383 00:20:13,480 --> 00:20:15,990 Kuna pia Spearman na Kendell correlations 384 00:20:15,990 --> 00:20:19,530 ambayo ni baadhi ya aina ya cheo ili uwiano. 385 00:20:19,530 --> 00:20:23,600 Naam hawana mahesabu bidhaa muda mfupi kati ya wadudu wenyewe, 386 00:20:23,600 --> 00:20:28,511 lakini wa cheo amri vector ya. 387 00:20:28,511 --> 00:20:29,510 Mimi itabidi kueleza kwamba baadaye. 388 00:20:29,510 --> 00:20:30,120 >> Watazamaji: Quick swali 389 00:20:30,120 --> 00:20:30,360 >> CONNER HARRIS: Ni kweli. 390 00:20:30,360 --> 00:20:33,151 >> Watazamaji: Hivyo wakati wewe ni kuhesabu kwa mahusiano rahisi kufanya 391 00:20:33,151 --> 00:20:37,655 wewe kudhani kuwa kuna takwimu umuhimu wa uwiano? 392 00:20:37,655 --> 00:20:39,030 CONNER HARRIS: Huwezi kuwa na uhuru. 393 00:20:39,030 --> 00:20:41,840 394 00:20:41,840 --> 00:20:43,960 Lm ni kimsingi tu mashine. 395 00:20:43,960 --> 00:20:47,690 Itachukua katika mambo mawili na itakuwa mate nje 396 00:20:47,690 --> 00:20:49,770 coefficients kwa bora fit line. 397 00:20:49,770 --> 00:20:52,310 Pia ripoti kiwango makosa juu ya coefficients hizo. 398 00:20:52,310 --> 00:20:55,865 Na itakuambia, kama ni kukatiza kitakwimu 399 00:20:55,865 --> 00:20:56,740 au tofauti kutoka 0. 400 00:20:56,740 --> 00:20:59,400 Ni mteremko wa bora fit mstari takwimu 401 00:20:59,400 --> 00:21:01,510 tofauti na sifuri, na kadhalika. 402 00:21:01,510 --> 00:21:06,260 Hivyo inadhaniwa chochote, nadhani ni jibu bora kwa swali lako. 403 00:21:06,260 --> 00:21:07,410 SAWA. 404 00:21:07,410 --> 00:21:14,650 >> Plotting-- hivyo sababu kuu unapaswa kutumia R, kama multilinear regression. 405 00:21:14,650 --> 00:21:17,320 Kimsingi kila lugha ina baadhi kituo kwa ajili hiyo. 406 00:21:17,320 --> 00:21:21,365 Na kwa uaminifu R syntax kwa regression ni kidogo arcane. 407 00:21:21,365 --> 00:21:22,990 Lakini vitimbi ni mahali ambapo kweli huangaza. 408 00:21:22,990 --> 00:21:28,090 >> Workhorse kazi ni njama na inachukua wadudu wawili, x na y. 409 00:21:28,090 --> 00:21:33,010 Na kisha ellipses anasimama kwa sana idadi kubwa ya hoja hiari kwamba 410 00:21:33,010 --> 00:21:39,190 kudhibiti kila kitu kutoka vyeo kwa rangi ya mistari mbalimbali au maeneo mbalimbali, 411 00:21:39,190 --> 00:21:40,200 na aina ya njama. 412 00:21:40,200 --> 00:21:42,250 Unaweza kuwa na kuwatawanya viwanja au mstari viwanja. 413 00:21:42,250 --> 00:21:47,900 414 00:21:47,900 --> 00:21:49,710 >> [Inaudible] 2 vectors ya urefu sawa. 415 00:21:49,710 --> 00:21:53,780 Unaweza kutanguliza huu kwa masharti data sura katika hati yako. 416 00:21:53,780 --> 00:22:01,220 Na hii basi wewe tu kutumia safu headers badala ya kusukuma tofauti. 417 00:22:01,220 --> 00:22:05,410 Unaweza kuongeza bora fit mistari na ndani regression curves kwa graph yako. 418 00:22:05,410 --> 00:22:09,390 >> Amri hizi waliotajwa hapa, ab mstari na mistari, 419 00:22:09,390 --> 00:22:11,640 by default hizi kupata Imeandikwa katika pop up madirisha 420 00:22:11,640 --> 00:22:15,560 kwa sababu inadhaniwa kuwa unatumia R interactively. 421 00:22:15,560 --> 00:22:17,310 Kama huna unaweza kuandika mafaili mawili ambayo 422 00:22:17,310 --> 00:22:21,600 ni katika kweli mfumo wowote Ningependa. 423 00:22:21,600 --> 00:22:25,410 Samahani, nina typo mimi tu barabara. 424 00:22:25,410 --> 00:22:30,887 425 00:22:30,887 --> 00:22:32,720 Kama unataka kufungua kifaa kingine graphical 426 00:22:32,720 --> 00:22:39,200 unaweza kutumia kazi huu inaitwa PNG au JPEG au mengi ya muundo mwingine picha. 427 00:22:39,200 --> 00:22:42,319 Na unaweza kuandika grafu kwa chochote faili jina bayana. 428 00:22:42,319 --> 00:22:45,110 Kufuta kwamba una use-- Sikuweza kuandika hii katika slide-- 429 00:22:45,110 --> 00:22:49,650 lakini kuna kazi kuitwa dev nukta mbali kwamba inachukua hakuna hoja. 430 00:22:49,650 --> 00:22:51,517 >> Kisha kuna vituo vya kwa 3D mipango 431 00:22:51,517 --> 00:22:53,350 na kwa contour kupanga njama kama unataka kufanya 432 00:22:53,350 --> 00:22:55,700 grafu ya mbili vigezo huru. 433 00:22:55,700 --> 00:22:57,150 Mimi si kupata katika hizi hivi sasa. 434 00:22:57,150 --> 00:22:59,130 >> Kuna pia baadhi ni vifaa kwa ajili ya uhuishaji 435 00:22:59,130 --> 00:23:01,300 wale ni kawaida iimarishwe na upande wa tatu. 436 00:23:01,300 --> 00:23:06,330 Nimefanya michoro na R grafu, lakini mimi si kutumika hivi chama cha tatu 437 00:23:06,330 --> 00:23:06,940 maktaba. 438 00:23:06,940 --> 00:23:09,929 Hivyo siwezi kushuhudia kwa jinsi nzuri walipo. 439 00:23:09,929 --> 00:23:12,220 Nini mimi kupendekeza kama unataka kufanya michoro kutumia R 440 00:23:12,220 --> 00:23:16,480 ni unaweza kuandika yote ya muafaka kwa michoro 441 00:23:16,480 --> 00:23:18,470 na basi unaweza kutumia chama cha tatu program-- 442 00:23:18,470 --> 00:23:23,630 ndio kawaida zinaitwa FFmpeg au ImageMagick-- kushona 443 00:23:23,630 --> 00:23:26,540 wote wa muafaka yako katika moja uhuishaji. 444 00:23:26,540 --> 00:23:28,380 >> Hivyo wakati kwa demo. 445 00:23:28,380 --> 00:23:31,030 446 00:23:31,030 --> 00:23:37,189 Hivyo kama wewe ni kutumia yoyote Unix kama mfumo ambayo ni Linux BSD lakini ambaye anatumia BSD. 447 00:23:37,189 --> 00:23:39,730 OS X kufungua dirisha terminal na aina R katika amri ya haraka. 448 00:23:39,730 --> 00:23:42,820 Kama una R studio au kama kwamba pia kazi. 449 00:23:42,820 --> 00:23:46,270 Kwa watumiaji Windows unapaswa kuwa uwezo wa kupata R katika Mwanzo orodha yako. 450 00:23:46,270 --> 00:23:50,390 Ni lazima kuitwa kitu kama R x64 3 hatua yoyote. 451 00:23:50,390 --> 00:23:53,110 Fungua kuwa hadi hapo. 452 00:23:53,110 --> 00:23:58,850 >> Hivyo sasa napenda tu kufungua dirisha terminal. 453 00:23:58,850 --> 00:24:02,562 Haki wote, utafutaji. 454 00:24:02,562 --> 00:24:03,520 Watazamaji: amri-Nafasi 455 00:24:03,520 --> 00:24:06,675 CONNER HARRIS: amri-Nafasi, asante. 456 00:24:06,675 --> 00:24:10,030 Sina kawaida kutumia Macs. 457 00:24:10,030 --> 00:24:13,310 Terminal, kuonyesha dirisha mpya. 458 00:24:13,310 --> 00:24:18,120 New dirisha ni mazingira msingi, R. Hivyo unapaswa kupata 459 00:24:18,120 --> 00:24:22,230 kuwakaribisha ujumbe, kitu kama hiki. 460 00:24:22,230 --> 00:24:31,060 >> Hivyo mimi nina kutumia R interactively. 461 00:24:31,060 --> 00:24:32,719 Unaweza pia kuandika maandiko R bila shaka. 462 00:24:32,719 --> 00:24:34,510 Kimsingi maandiko kukimbia halisi njia ile ile kama 463 00:24:34,510 --> 00:24:40,250 kama wewe walikuwa wamekaa katika kompyuta kuandika katika kila mstari mmoja kwa wakati. 464 00:24:40,250 --> 00:24:42,660 Basi hebu kuanza kwa kufanya vector. 465 00:24:42,660 --> 00:24:46,230 Mshale C 1, 2. 466 00:24:46,230 --> 00:24:49,400 1, 2, 4. 467 00:24:49,400 --> 00:24:50,050 OK, uhakika. 468 00:24:50,050 --> 00:24:51,630 Siwezi kufanya ukubwa wa herufi kubwa. 469 00:24:51,630 --> 00:24:53,030 >> Watazamaji: amri-Plus 470 00:24:53,030 --> 00:24:53,650 >> CONNER HARRIS: amri-Plus. 471 00:24:53,650 --> 00:24:54,191 Amri-Plus. 472 00:24:54,191 --> 00:24:57,610 473 00:24:57,610 --> 00:25:00,370 Sawa, jinsi hiyo? 474 00:25:00,370 --> 00:25:00,870 Nzuri? 475 00:25:00,870 --> 00:25:01,551 SAWA. 476 00:25:01,551 --> 00:25:03,300 Basi hebu kuanza na kutangaza orodha vector. 477 00:25:03,300 --> 00:25:08,710 Je, mshale, C 1,2,4. 478 00:25:08,710 --> 00:25:11,181 Tunaweza kuona. 479 00:25:11,181 --> 00:25:12,680 Msiwe na wasiwasi juu mabano hapo. 480 00:25:12,680 --> 00:25:18,590 Mabano ni hivyo kama wewe magazeti nje arrays muda mrefu sana, tunaweza wapi. 481 00:25:18,590 --> 00:25:26,987 Mfano mmoja itakuwa kama mimi nataka tu mbalimbali 2-200. 482 00:25:26,987 --> 00:25:28,820 Kama mimi kuchapishwa sana muda mrefu safu, mabano 483 00:25:28,820 --> 00:25:31,060 ni tu hivyo siwezi kuweka wimbo wa ambayo ripoti 484 00:25:31,060 --> 00:25:33,250 sisi ni juu ya kama mimi nina kuangalia kwa njia hii kuibua. 485 00:25:33,250 --> 00:25:36,570 486 00:25:36,570 --> 00:25:38,280 Hivyo hata hivyo, tuna. 487 00:25:38,280 --> 00:25:43,326 >> Hivyo nilivyosema hapo kabla kwamba arrays kuingiliana vizuri mno na, kwa mfano, 488 00:25:43,326 --> 00:25:44,450 shughuli unary kama hii. 489 00:25:44,450 --> 00:25:46,500 Hivyo nini unafikiri mimi itabidi kupata kama mimi aina pamoja na 1? 490 00:25:46,500 --> 00:25:49,630 491 00:25:49,630 --> 00:25:51,140 Yep. 492 00:25:51,140 --> 00:25:54,250 Haki, sasa mimi itabidi kufanya hii safu mbalimbali. 493 00:25:54,250 --> 00:26:01,650 Hebu sema b c 20,40, 80. 494 00:26:01,650 --> 00:26:03,400 Hivyo unafikiri nini amri hii itakuwa nini? 495 00:26:03,400 --> 00:26:09,962 496 00:26:09,962 --> 00:26:10,670 Kuongeza mambo. 497 00:26:10,670 --> 00:26:14,950 Na hivyo kimsingi hiyo ni nini anafanya. 498 00:26:14,950 --> 00:26:16,740 Hivyo hii ni pretty rahisi. 499 00:26:16,740 --> 00:26:23,800 Hivyo mimi vipi kuhusu mimi kufanya hivyo. c ni, hebu sema, mara Juni 01-10. 500 00:26:23,800 --> 00:26:26,789 501 00:26:26,789 --> 00:26:28,830 Basi je, nataka kuona zilizomo, unafikiri? 502 00:26:28,830 --> 00:26:37,110 503 00:26:37,110 --> 00:26:38,110 Basi wote mafungu ya sita. 504 00:26:38,110 --> 00:26:42,170 Sasa, unafikiri nini kitatokea kama mimi kufanya hili? 505 00:26:42,170 --> 00:26:48,090 Mimi itabidi kufanya hii kidogo wazi, c, c. 506 00:26:48,090 --> 00:26:50,365 Hivyo kile kinachotokea, je kufikiri, kama mimi kufanya hili? 507 00:26:50,365 --> 00:26:51,488 pamoja c. 508 00:26:51,488 --> 00:26:55,550 509 00:26:55,550 --> 00:26:56,050 [Inaudible] 510 00:26:56,050 --> 00:26:58,552 511 00:26:58,552 --> 00:27:02,350 >> Watazamaji: Aidha kosa au ni tu anaongeza kwanza mambo matatu. 512 00:27:02,350 --> 00:27:04,510 >> CONNER HARRIS: Si kabisa. 513 00:27:04,510 --> 00:27:05,522 Hii ni nini tulipata. 514 00:27:05,522 --> 00:27:08,910 Kinachojitokeza ni mfupi safu,, got waliendesha. 515 00:27:08,910 --> 00:27:13,990 Hivyo tulipata 124, 124, 124. 516 00:27:13,990 --> 00:27:15,710 Naam. 517 00:27:15,710 --> 00:27:18,940 Na kimsingi, unaweza kuona tabia hii kabla, pamoja na 1, 518 00:27:18,940 --> 00:27:22,190 kama Aina ya tabia hii, ambapo mfupi safu ni idadi tu 519 00:27:22,190 --> 00:27:25,410 1, ambayo ni moja ya kipengele safu. 520 00:27:25,410 --> 00:27:27,740 I just kuwa akisema vector zote wakati badala ya safu, 521 00:27:27,740 --> 00:27:30,290 kwa sababu hiyo ndiyo r nyaraka kawaida gani. 522 00:27:30,290 --> 00:27:33,070 Ni ingrained c tabia. 523 00:27:33,070 --> 00:27:37,590 >> Sawa, na hivyo kwa sasa tuna safu hii. 524 00:27:37,590 --> 00:27:38,830 Hivyo tuna safu hii, c. 525 00:27:38,830 --> 00:27:41,380 Tunaweza kupata muhtasari takwimu juu ya c, muhtasari c. 526 00:27:41,380 --> 00:27:46,920 527 00:27:46,920 --> 00:27:48,280 Na hiyo ndiyo nzuri. 528 00:27:48,280 --> 00:27:51,070 529 00:27:51,070 --> 00:27:52,670 Hivyo sasa hebu kufanya baadhi ya mambo tumbo. 530 00:27:52,670 --> 00:27:56,160 Hebu sema m ni tumbo. 531 00:27:56,160 --> 00:27:57,780 Hebu kufanya hivyo vitatu na tatu moja. 532 00:27:57,780 --> 00:28:01,630 Hivyo nrows sawa na 3, na ncols sawa na 3. 533 00:28:01,630 --> 00:28:04,190 534 00:28:04,190 --> 00:28:10,710 Na kwa data hebu do-- hivyo nini unafikiri hii ni kwenda kufanya? 535 00:28:10,710 --> 00:28:15,310 536 00:28:15,310 --> 00:28:16,580 >> Haki, ni moja ijayo. 537 00:28:16,580 --> 00:28:17,970 Ni nrow na ncolumn. 538 00:28:17,970 --> 00:28:22,164 539 00:28:22,164 --> 00:28:24,580 Hivyo kile nimepata kufanyika ni Nimekuwa alitangaza tatu na tatu tumbo 540 00:28:24,580 --> 00:28:26,950 na nimekuwa kupita katika tisa kipengele safu. 541 00:28:26,950 --> 00:28:30,530 Hivyo logarithm ya yote mambo moja kwa njia ya tisa. 542 00:28:30,530 --> 00:28:33,400 543 00:28:33,400 --> 00:28:37,285 Na maadili ya wale wote kujaza up pole array--? 544 00:28:37,285 --> 00:28:38,660 Watazamaji: Hayo ni msingi 10 magogo? 545 00:28:38,660 --> 00:28:41,284 CONNER HARRIS: Hapana, gogo ni logarithms asili, hivyo wigo wa e. 546 00:28:41,284 --> 00:28:44,886 547 00:28:44,886 --> 00:28:47,010 Naam, kama alitaka msingi 10 gogo, nadhani wewe d kuwa 548 00:28:47,010 --> 00:28:51,620 kuingia chochote, kugawanywa na gogo 10. 549 00:28:51,620 --> 00:28:56,750 Na hivyo data ya [inaudible] tu fyller safu, hivyo juu hadi chini, 550 00:28:56,750 --> 00:28:59,490 kisha kushoto kwenda kulia. 551 00:28:59,490 --> 00:29:06,890 Na kama wewe alitaka kufanya baadhi ya wengine safu, hebu sema n ni tumbo. 552 00:29:06,890 --> 00:29:10,317 Hebu kufanya, sijui, 2 hadi 13. 553 00:29:10,317 --> 00:29:11,900 Au mimi itabidi kufanya kitu zaidi ya kuvutia. 554 00:29:11,900 --> 00:29:13,770 Mimi itabidi kufanya 2 hadi 4. 555 00:29:13,770 --> 00:29:15,780 nrow sawa na, hebu sema, 3. 556 00:29:15,780 --> 00:29:18,992 ncol sawa na 4. 557 00:29:18,992 --> 00:29:20,360 n. 558 00:29:20,360 --> 00:29:22,090 Hivyo sisi tumepewa hii. 559 00:29:22,090 --> 00:29:26,130 >> Na sasa kama tunataka kuzidisha hayo, tunataka kufanya asilimia nyakati asilimia n, 560 00:29:26,130 --> 00:29:27,680 kwa sababu hiyo ni n. 561 00:29:27,680 --> 00:29:30,234 562 00:29:30,234 --> 00:29:31,400 Na tuna bidhaa tumbo. 563 00:29:31,400 --> 00:29:33,970 564 00:29:33,970 --> 00:29:37,810 By wao njia, je, unaweza kuona jinsi wakati mimi alitangaza n, 2 hadi 4 565 00:29:37,810 --> 00:29:43,570 vector got waliendesha mpaka ni kujazwa juu wote wa n? 566 00:29:43,570 --> 00:29:45,710 Kama alitaka kuchukua eigenvalue mtengano, 567 00:29:45,710 --> 00:29:46,960 hili ni jambo tunaweza kufanya kwa urahisi sana. 568 00:29:46,960 --> 00:29:47,709 Tunaweza kufanya Eigen n. 569 00:29:47,709 --> 00:29:52,290 570 00:29:52,290 --> 00:29:54,600 Na hivyo hii ni yetu kwanza kukutana na orodha. 571 00:29:54,600 --> 00:29:57,000 >> Hivyo Eigen n ni orodha na funguo mbili. 572 00:29:57,000 --> 00:29:58,430 Maadili, ambayo ni safu hii hapa. 573 00:29:58,430 --> 00:30:01,030 Na wadudu, ambayo ni hii safu hapa. 574 00:30:01,030 --> 00:30:08,240 Hivyo kama alitaka kutafuta, kusema, safu hii ya tatu 575 00:30:08,240 --> 00:30:13,080 kutoka eigenvectors tumbo, kwa sababu eigenvectors ni wadudu safu. 576 00:30:13,080 --> 00:30:24,400 Hivyo tunaweza kufanya Zaidi sehemu Eigen ishara ya dola n vectors, comma 3, ya [inaudible]. 577 00:30:24,400 --> 00:30:29,800 578 00:30:29,800 --> 00:30:30,900 Nyingine. 579 00:30:30,900 --> 00:30:34,100 Ni kwamba, kama unaweza kutarajia. 580 00:30:34,100 --> 00:30:39,210 >> Kisha kusema n mara mara asilimia Nyingine. 581 00:30:39,210 --> 00:30:42,610 582 00:30:42,610 --> 00:30:48,320 Hivyo matokeo hapa hakika inaonekana kama kama sisi alichukua eigenvalue tatu hapa, 583 00:30:48,320 --> 00:30:50,390 ambayo sambamba na eigenvector tatu. 584 00:30:50,390 --> 00:30:53,190 Ni tu kuyagawa kila kitu katika eigenvector hii, sehemu-busara, 585 00:30:53,190 --> 00:30:53,990 na eigenvalue. 586 00:30:53,990 --> 00:30:57,760 Na kwamba ni nini sisi bila kutarajia, kwa sababu hiyo ndiyo eigenvalues ​​ni. 587 00:30:57,760 --> 00:31:00,890 Kuna mtu yeyote hapa si kuchukuliwa algebra linear? 588 00:31:00,890 --> 00:31:02,530 Watu kadhaa, sawa. 589 00:31:02,530 --> 00:31:04,030 Tu kugeuka akili yako mbali kwa kidogo. 590 00:31:04,030 --> 00:31:07,490 591 00:31:07,490 --> 00:31:20,720 Na hakika kama sisi kuchukua Eigen n dola ishara maadili mara 3 Nyingine, 592 00:31:20,720 --> 00:31:21,810 vizuri kupata kitu kimoja. 593 00:31:21,810 --> 00:31:24,726 Ni mpangilio tofauti kama mstari vector badala ya safu vector, 594 00:31:24,726 --> 00:31:25,640 lakini kubwa mpango huo. 595 00:31:25,640 --> 00:31:29,430 596 00:31:29,430 --> 00:31:35,170 Na hivyo wale ni kimsingi nzuri mambo ambayo tunaweza kufanya na matrices, 597 00:31:35,170 --> 00:31:36,489 alionyesha orodha. 598 00:31:36,489 --> 00:31:39,030 Mimi lazima kuonyesha nzuri mambo kuhusu kazi pia. 599 00:31:39,030 --> 00:31:41,750 >> Basi hebu say-- [inaudible] kazi, wito hebu 600 00:31:41,750 --> 00:31:51,960 ni func dhidi kazi n n squared-- kweli, hiyo si kweli bora. 601 00:31:51,960 --> 00:31:55,632 a, b, mraba pamoja b. 602 00:31:55,632 --> 00:31:58,547 603 00:31:58,547 --> 00:32:00,380 Kwa hiyo, jambo moja kuhusu kazi, tena, ni wao 604 00:32:00,380 --> 00:32:01,963 hawana haja ya kauli kurudi wazi. 605 00:32:01,963 --> 00:32:04,250 Hivyo unaweza just-- Kauli ya mwisho tathmini 606 00:32:04,250 --> 00:32:07,502 itakuwa kauli akarudi, au thamani akarudi. 607 00:32:07,502 --> 00:32:10,460 Hivyo katika kesi hii, sisi ni kufanya tathmini ya pekee Kauli moja, mraba pamoja b. 608 00:32:10,460 --> 00:32:12,043 Ambayo itakuwa thamani chaguo-msingi kurudi. 609 00:32:12,043 --> 00:32:14,530 Kamwe machungu ya kuweka katika kurudi maadili waziwazi, 610 00:32:14,530 --> 00:32:16,880 hasa kama wewe ni kushughulika na kazi ya ngumu sana mantiki 611 00:32:16,880 --> 00:32:17,380 kati yake. 612 00:32:17,380 --> 00:32:18,450 Lakini huna haja yao. 613 00:32:18,450 --> 00:32:24,890 Hivyo sasa tunaweza kufanya func 5, 1, na hii ni kimsingi nini d kutarajia. 614 00:32:24,890 --> 00:32:29,146 615 00:32:29,146 --> 00:32:31,270 Kitu kingine tunaweza kufanya, sisi kweli anaweza kufanya func b 616 00:32:31,270 --> 00:32:33,260 sawa na 1, sawa na 5. 617 00:32:33,260 --> 00:32:36,870 618 00:32:36,870 --> 00:32:40,770 Hivyo kama sisi kutaja idadi hapa, ambayo hoja inakwenda ambayo hoja 619 00:32:40,770 --> 00:32:44,680 katika kazi, tunaweza flip kuzunguka maadili haya popote tunataka. 620 00:32:44,680 --> 00:32:48,405 >> Watazamaji: Je, kuna sababu kuandika ni nje na b 621 00:32:48,405 --> 00:32:52,404 sawa kinyume na kutumia tu idadi na comma? 622 00:32:52,404 --> 00:32:54,820 CONNER HARRIS: Yeah, kwa kawaida kufanya hivyo kama una majukumu 623 00:32:54,820 --> 00:32:58,540 na mengi ya hoja. 624 00:32:58,540 --> 00:33:00,690 Ambavyo vinaweza mara nyingi kuwa kama bendera kwamba wewe d tu 625 00:33:00,690 --> 00:33:03,130 wanataka kutumia katika hafla nadra. 626 00:33:03,130 --> 00:33:06,740 Na njia hii unaweza only-- wewe inaweza kutaja hoja maalum 627 00:33:06,740 --> 00:33:09,110 kwamba unataka kutumia maadili zisizo chaguo-msingi kwa, 628 00:33:09,110 --> 00:33:14,470 na huna kuandika kundi la bendera sawa na uongo nyuma yao. 629 00:33:14,470 --> 00:33:19,710 Au siwezi kuandika hii tena na thamani chaguo-msingi kama b sawa na 2. 630 00:33:19,710 --> 00:33:26,289 Na kisha mimi naweza kufanya f func, Mimi itabidi kufanya 4, 1 wakati huu. 631 00:33:26,289 --> 00:33:28,580 Na 17, ambayo ni 4 mraba pamoja na 1, kama unaweza kutarajia. 632 00:33:28,580 --> 00:33:34,290 >> Lakini mimi naweza pia tu wito huu kwa func 4, 633 00:33:34,290 --> 00:33:36,970 na mimi itabidi kupata 18, kwa sababu Mimi si bayana b. 634 00:33:36,970 --> 00:33:38,550 Hivyo b anapata chaguo-msingi thamani ya 2. 635 00:33:38,550 --> 00:33:41,700 636 00:33:41,700 --> 00:33:47,200 >> OK, hivyo sasa kama wewe ni kufuatia pamoja na demo, 637 00:33:47,200 --> 00:33:51,010 aina ya mstari huu kwa amri yako kuchochea na kuona nini anakuja juu. 638 00:33:51,010 --> 00:33:52,090 Kwa kweli, kufanya hivyo. 639 00:33:52,090 --> 00:33:52,590 Aina hii. 640 00:33:52,590 --> 00:33:57,780 641 00:33:57,780 --> 00:34:01,000 Unapaswa kupata kitu kama hiki. 642 00:34:01,000 --> 00:34:04,780 Hivyo mtcars ni kujengwa katika data kuweka kwa maandamano haya 643 00:34:04,780 --> 00:34:13,550 madhumuni kwamba anakuja with-- kwamba anakuja katika na msingi na usambazaji yako r. 644 00:34:13,550 --> 00:34:19,211 Hii ni mkusanyiko wa takwimu kutoka Suala 1974 ya gazeti Motor Mwenendo wa 645 00:34:19,211 --> 00:34:20,710 na idadi ya mifano mbalimbali ya gari. 646 00:34:20,710 --> 00:34:28,270 >> Hivyo kuna maili kwa kila lita, cylinders-- Mimi kusahau kile disp is-- horsepower. 647 00:34:28,270 --> 00:34:31,610 648 00:34:31,610 --> 00:34:32,420 Pengine. 649 00:34:32,420 --> 00:34:36,920 Kama wewe tu magari Google MT, kisha moja ya matokeo ya kwanza 650 00:34:36,920 --> 00:34:38,730 itakuwa kutoka rasmi r nyaraka 651 00:34:38,730 --> 00:34:41,080 na itakuwa kueleza haya yote data mashamba. 652 00:34:41,080 --> 00:34:47,020 Hivyo uzito is-- wt ni uzito wa gari katika tani. 653 00:34:47,020 --> 00:34:48,880 Swali sec ni muda robo maili. 654 00:34:48,880 --> 00:34:52,409 655 00:34:52,409 --> 00:34:55,850 Hivyo sasa tunaweza kufanya baadhi ya mambo ya kujifurahisha kuhusu MT magari ni uwanja data. 656 00:34:55,850 --> 00:35:01,640 >> Hivyo tunaweza kufanya mambo kama majina mstari, mt magari. 657 00:35:01,640 --> 00:35:05,490 Na hii ni orodha ya mistari yote katika data kuweka ambayo ni majina ya magari. 658 00:35:05,490 --> 00:35:10,780 Tunaweza kufanya colnames, mt magari hili. 659 00:35:10,780 --> 00:35:15,500 Kama wewe kufanya mt magari, ndogo ya namba ripoti, kama 2. 660 00:35:15,500 --> 00:35:18,177 tunapata safu ya pili kutoka hii, ambayo itakuwa ni mitungi. 661 00:35:18,177 --> 00:35:19,370 >> Watazamaji: Je, mlifanya nini? 662 00:35:19,370 --> 00:35:21,570 >> CONNER HARRIS: mimi niliandika mt magari, mabano e, 663 00:35:21,570 --> 00:35:24,180 ambayo alinipa pili safu ya nje ya mt magari. 664 00:35:24,180 --> 00:35:34,501 665 00:35:34,501 --> 00:35:38,110 Au kama tunataka mstari, siwezi aina mtcars comma 2, kwa mfano. 666 00:35:38,110 --> 00:35:41,850 667 00:35:41,850 --> 00:35:46,390 Nyingine pande zote 2 comma, namna hiyo. 668 00:35:46,390 --> 00:35:48,880 Na kwamba huenda katika mstari yako. 669 00:35:48,880 --> 00:35:54,680 Hii hapa tu anatoa safu, lakini safu kama vector. 670 00:35:54,680 --> 00:36:04,634 671 00:36:04,634 --> 00:36:06,425 Mimi tu barabara sasa mimi alisahau kuonyesha 672 00:36:06,425 --> 00:36:09,150 baadhi ya mambo ya baridi juu ya wadudu kwamba unaweza kufanya na fahirisi. 673 00:36:09,150 --> 00:36:10,480 Hivyo basi mimi kufanya hivyo hivi sasa. 674 00:36:10,480 --> 00:36:17,130 Basi hebu kufanya c gets-- kuweka huu juu ya pause-- 2 mara 1 mpaka 10. 675 00:36:17,130 --> 00:36:21,360 Hivyo c ni kwenda tu kuwa vector 2 mpaka 20. 676 00:36:21,360 --> 00:36:24,640 Naweza kuchukua mambo kama hii, C2. 677 00:36:24,640 --> 00:36:30,942 Siwezi kupita katika vector kama hii, c-- napenda 678 00:36:30,942 --> 00:36:34,470 kutumia jina tofauti na c, kama Zaidi sehemu c. 679 00:36:34,470 --> 00:36:37,591 680 00:36:37,591 --> 00:36:39,340 Kimsingi, mimi nina kufanya hii ili huna kupata 681 00:36:39,340 --> 00:36:45,010 kuchanganyikiwa kati ya c kama vector ujenzi kazi, 682 00:36:45,010 --> 00:36:48,800 na kisha c kama jina kutofautiana. 683 00:36:48,800 --> 00:36:53,120 Mabano Zaidi sehemu c 4, 5, 7. 684 00:36:53,120 --> 00:36:56,540 Hii itabidi kupata nami nje nne, tano, na mambo saba ya safu. 685 00:36:56,540 --> 00:37:01,740 Siwezi kufanya Nyingine, kuweka katika hasi ripoti, kama hasi 4. 686 00:37:01,740 --> 00:37:06,500 Kwamba kupata nami nje huu kwa kipengele cha nne kuondolewa. 687 00:37:06,500 --> 00:37:10,140 Basi kama nilitaka kufanya vipande, Siwezi kufanya Zaidi sehemu 2 hadi 6. 688 00:37:10,140 --> 00:37:15,480 2 matumbo 6 ni mwingine tu vector, ambayo ni 2, 3, 4, 5, 6. 689 00:37:15,480 --> 00:37:18,230 Mtemi huo. 690 00:37:18,230 --> 00:37:20,770 >> Hivyo hata hivyo, nyuma ya mt magari. 691 00:37:20,770 --> 00:37:26,650 692 00:37:26,650 --> 00:37:28,450 Basi hebu kufanya baadhi regressions. 693 00:37:28,450 --> 00:37:34,240 Hebu sema mfano gets-- hebu mstari regress-- Sijui. 694 00:37:34,240 --> 00:37:41,780 Kwanza hebu kufanya ambatisha mtcars, bila shaka. 695 00:37:41,780 --> 00:37:44,870 696 00:37:44,870 --> 00:38:00,010 Hivyo [inaudible] mfano lm, hebu regress maili kwa kila lita ya tilde uzito. 697 00:38:00,010 --> 00:38:03,300 Na kisha sura data ni mtcars. 698 00:38:03,300 --> 00:38:06,830 Hivyo muhtasari mfano wa kuigwa. 699 00:38:06,830 --> 00:38:12,900 700 00:38:12,900 --> 00:38:15,595 >> OK, hivyo hii inaonekana kidogo ngumu. 701 00:38:15,595 --> 00:38:19,380 Lakini kimsingi, kwa kuwa kama sisi kujaribu kueleza maili kwa kila lita 702 00:38:19,380 --> 00:38:23,970 kama kazi linear ya uzito, kisha tulipata mstari huu hapa, 703 00:38:23,970 --> 00:38:28,730 ambayo intercepts katika 37.28. 704 00:38:28,730 --> 00:38:33,830 37.28 itakuwa maili kinadharia kila lita ya gari ambayo ina uzito sifuri. 705 00:38:33,830 --> 00:38:41,210 Na kisha kwa kila tani za ziada, wewe kubisha maili tano kwa kila lita 706 00:38:41,210 --> 00:38:42,440 mbali ya kwamba. 707 00:38:42,440 --> 00:38:45,120 Wote coefficients hizi Unaweza kuona, kiwango cha makosa huko. 708 00:38:45,120 --> 00:38:47,870 Na wao ni sana kitakwimu. 709 00:38:47,870 --> 00:38:55,740 >> Ili tuweze kuwa na uhakika sana kwa 1 e 10 kwa 10 hasi. 710 00:38:55,740 --> 00:38:59,510 Hivyo 1 mara kitu cha hasi 10, kwamba kama wewe kufanya gari nzito, 711 00:38:59,510 --> 00:39:01,440 itakuwa na maili mbaya kwa kila lita. 712 00:39:01,440 --> 00:39:04,940 713 00:39:04,940 --> 00:39:07,250 Au tunaweza kupima baadhi mfano mmoja. 714 00:39:07,250 --> 00:39:09,230 Kama badala ya regressing hii juu ya uzito, 715 00:39:09,230 --> 00:39:12,600 hebu regress kwenye gogo la uzito, kwa sababu labda uzito ufanisi 716 00:39:12,600 --> 00:39:15,690 juu ya mileage ni namna fulani si linear. 717 00:39:15,690 --> 00:39:18,540 >> Hii alitupa r mraba ya 0.7528. 718 00:39:18,540 --> 00:39:19,610 Basi hebu jaribu hii. 719 00:39:19,610 --> 00:39:21,485 Wakati huu hebu kufanya kutofautiana mbalimbali, pia. 720 00:39:21,485 --> 00:39:22,500 Model2. 721 00:39:22,500 --> 00:39:24,800 Hivyo muhtasari, model2. 722 00:39:24,800 --> 00:39:28,200 723 00:39:28,200 --> 00:39:31,390 Haki wote, hivyo tena, sisi got fit wetu bora mstari hapa. 724 00:39:31,390 --> 00:39:36,160 Na hii time-- hii ni kusema, kimsingi kwamba kila wakati 725 00:39:36,160 --> 00:39:38,090 kuongeza uzito wa gari kwa sababu ya e 726 00:39:38,090 --> 00:39:40,580 wewe kupoteza maili hii wengi kwa kila lita. 727 00:39:40,580 --> 00:39:43,210 728 00:39:43,210 --> 00:39:50,326 >> Na hivyo wakati huu wetu kiwango mabaki kosa it-- hiyo haina jambo, kwa kweli. 729 00:39:50,326 --> 00:39:53,540 Mabaki makosa ya hali ni kimsingi tu makosa ya hali 730 00:39:53,540 --> 00:39:57,760 kwamba umeacha baada kuchukua mstari mwenendo. 731 00:39:57,760 --> 00:40:02,805 Na r yetu mraba hapa ni 0.81, ambayo ni kidogo bora kuliko yale 732 00:40:02,805 --> 00:40:07,640 tulikuwa na kabla, 0.52. 733 00:40:07,640 --> 00:40:09,750 >> Na hivyo sasa hebu kuongeza mrefu kwa kurudi nyuma hii. 734 00:40:09,750 --> 00:40:13,020 Basi hebu regress maili kwa kila lita wote juu ya gogo la uzito 735 00:40:13,020 --> 00:40:21,130 na, hebu kufanya, q maili, robo maili huo. 736 00:40:21,130 --> 00:40:26,190 OK, ni lazima kuwa na the-- sawa, qsec. 737 00:40:26,190 --> 00:40:26,690 Qsec. 738 00:40:26,690 --> 00:40:30,630 739 00:40:30,630 --> 00:40:35,000 Actually-- pole, nini? 740 00:40:35,000 --> 00:40:37,000 Napenda kuwaita hii kitu mwingine zaidi model2. 741 00:40:37,000 --> 00:40:38,000 Napenda kuwaita model3 hii. 742 00:40:38,000 --> 00:40:40,860 743 00:40:40,860 --> 00:40:42,900 Na hivyo sasa tunaweza kufanya muhtasari model3. 744 00:40:42,900 --> 00:40:46,850 745 00:40:46,850 --> 00:40:49,100 Na hivyo tena, hii ni kimsingi nini unaweza kutarajia. 746 00:40:49,100 --> 00:40:51,750 Una kukatiza chanya. 747 00:40:51,750 --> 00:40:54,550 Ufanisi na kuongeza uzito ni hasi. 748 00:40:54,550 --> 00:40:58,490 Na ufanisi kuongeza robo maili muda 749 00:40:58,490 --> 00:41:02,420 ni chanya, lakini ingawa wachache kuliko uzito. 750 00:41:02,420 --> 00:41:06,010 Sasa shirikishi, unaweza kufanya maana ya hii kwa kusema fikiria magari ya michezo. 751 00:41:06,010 --> 00:41:08,950 Kuna kuongeza kasi ya haraka sana, mfupi sana robo maili nyakati. 752 00:41:08,950 --> 00:41:13,729 Wao ni pia kwenda kutumia gesi zaidi, ambapo magari busara zaidi ni kwenda 753 00:41:13,729 --> 00:41:16,020 kuwa polepole kuongeza kasi, juu ya mara robo maili, 754 00:41:16,020 --> 00:41:20,890 na kutumia chini ya gesi ,, hivyo maili ya juu kwa kila lita. 755 00:41:20,890 --> 00:41:21,390 Kubwa. 756 00:41:21,390 --> 00:41:23,431 Na hivyo sasa ni wakati wa njama kitu kama hiki. 757 00:41:23,431 --> 00:41:27,810 Basi hebu do-- hivyo wazi mifupa tunaweza kufanya plots-- 758 00:41:27,810 --> 00:41:35,280 kwa sababu nimekuwa masharti sura hii data kabla, tunaweza tu kufanya viwanja, wt mpg. 759 00:41:35,280 --> 00:41:38,762 760 00:41:38,762 --> 00:41:39,720 Kufanya hii kidogo kubwa. 761 00:41:39,720 --> 00:41:55,050 762 00:41:55,050 --> 00:41:57,350 Huko, sisi kimsingi kuwa kuwatawanya njama, lakini pointi 763 00:41:57,350 --> 00:41:58,690 ni aina ya vigumu kuona juu ya hili. 764 00:41:58,690 --> 00:42:04,860 765 00:42:04,860 --> 00:42:10,900 >> Sikumbuki offhand nini syntax ni kwa ajili ya kubadilisha njama. 766 00:42:10,900 --> 00:42:14,100 Kwa hiyo mimi nadhani hii itakuwa wakati mzuri wa kuleta, 767 00:42:14,100 --> 00:42:18,000 kuna nzuri sana msaada builtin kipengele, msaada quotes kazi jina. 768 00:42:18,000 --> 00:42:21,690 Tutaweza kuleta kimsingi kitu chochote Ningependa. 769 00:42:21,690 --> 00:42:28,010 770 00:42:28,010 --> 00:42:32,730 Nadhani itabidi kweli kufanya hii sawa aina p kwa pointi viwanja. 771 00:42:32,730 --> 00:42:34,369 Je, kwamba kubadili jambo lolote? 772 00:42:34,369 --> 00:42:35,160 Na hakuna, si kweli. 773 00:42:35,160 --> 00:42:39,160 774 00:42:39,160 --> 00:42:39,660 Sawa. 775 00:42:39,660 --> 00:42:46,760 776 00:42:46,760 --> 00:42:49,580 >> Kwa sababu fulani, wakati mimi alifanya hivyo kwenye kompyuta yangu mwenyewe wakati iliyopita, 777 00:42:49,580 --> 00:42:52,080 pointi zote kuwatawanya walikuwa wazi sana. 778 00:42:52,080 --> 00:43:06,390 779 00:43:06,390 --> 00:43:13,970 Hata hivyo, ni kuwatawanya aina ya dhahiri? 780 00:43:13,970 --> 00:43:15,124 Kuna moja huko. 781 00:43:15,124 --> 00:43:16,165 Chache huko, chache huko. 782 00:43:16,165 --> 00:43:18,860 783 00:43:18,860 --> 00:43:21,185 Unaweza aina ya kuwaona, sawa? 784 00:43:21,185 --> 00:43:24,310 Hivyo kama tunataka kuongeza bora fit mstari kwa njama hii hapa, ambayo ni kidogo wazi 785 00:43:24,310 --> 00:43:29,290 bones-- napenda kufanya hivyo kidogo nicer. 786 00:43:29,290 --> 00:43:38,075 Kuu sawa na dhidi ya uzito. 787 00:43:38,075 --> 00:43:46,322 788 00:43:46,322 --> 00:43:49,740 Maili kwa kila lita. 789 00:43:49,740 --> 00:43:53,570 Tena, unaweza kuona jinsi muhimu hoja hiari ni hapa kwa pia 790 00:43:53,570 --> 00:43:58,090 kutokuwa na kuweka mambo katika utaratibu fulani kwa hoja keyboard 791 00:43:58,090 --> 00:44:01,600 wakati una viwanja, kwa sababu hizi kuchukua mengi ya hoja. 792 00:44:01,600 --> 00:44:07,490 >> Xlab sawa na uzito, uzito, tani. 793 00:44:07,490 --> 00:44:10,091 794 00:44:10,091 --> 00:44:10,590 Sawa. 795 00:44:10,590 --> 00:44:17,340 796 00:44:17,340 --> 00:44:21,480 OK, yeah, kifaa hiki ni kuwa kidogo annoying. 797 00:44:21,480 --> 00:44:30,160 Lakini unaweza kuona aina ya huko, kuna kichwa graph upande. 798 00:44:30,160 --> 00:44:35,260 Juu hapa there's-- juu ya chini hapa kuna maandiko mhimili. 799 00:44:35,260 --> 00:44:37,700 Sikumbuki offhand nini amri ars-- 800 00:44:37,700 --> 00:44:41,000 nini kazi ni kuongeza ukubwa wa maandiko hayo na vyeo, 801 00:44:41,000 --> 00:44:43,110 lakini wao uko pale. 802 00:44:43,110 --> 00:44:46,625 >> Na hivyo kama tunataka kuongeza bora fit line, 803 00:44:46,625 --> 00:44:49,250 tunaweza kufanya kitu kama mimi na syntax kuandikwa hapa. 804 00:44:49,250 --> 00:44:52,280 805 00:44:52,280 --> 00:45:11,130 Basi kumbuka sisi tu kuongeza mfano ilikuwa mpg, uzito, mtcars. 806 00:45:11,130 --> 00:45:16,470 Na hivyo kama nilitaka kuongeza fit bora mstari, mimi naweza kufanya, b mstari mfano wa kuigwa. 807 00:45:16,470 --> 00:45:18,556 Na boom, tuna bora fit line. 808 00:45:18,556 --> 00:45:19,970 Ni aina ya vigumu kuona tena. 809 00:45:19,970 --> 00:45:22,178 Nasikitika kabisa kuhusu matatizo ya kiteknolojia. 810 00:45:22,178 --> 00:45:25,230 Lakini ni anaendesha kimsingi juu kushoto upande wa kulia chini. 811 00:45:25,230 --> 00:45:27,550 >> Na kama wadogo walikuwa kubwa zaidi, unaweza kuona 812 00:45:27,550 --> 00:45:31,260 kwamba kukatiza ni nini unaweza kupata kutoka takwimu muhtasari 813 00:45:31,260 --> 00:45:34,790 kama aina muhtasari mfano wa kuigwa. 814 00:45:34,790 --> 00:45:40,130 OK, hivyo natumaini kila mtu anapata kitu ya maana ya kile 815 00:45:40,130 --> 00:45:42,030 R ni, nini ni nzuri kwa ajili ya. 816 00:45:42,030 --> 00:45:45,520 Unaweza kufanya viwanja mbali nicer kuliko huu juu ya muda wako mwenyewe, kama wewe kama. 817 00:45:45,520 --> 00:45:50,100 818 00:45:50,100 --> 00:45:53,950 >> Hivyo kazi ya nje interface. 819 00:45:53,950 --> 00:46:00,330 Hii ni kitu ambacho si kawaida kufunikwa katika mihadhara ya utangulizi 820 00:46:00,330 --> 00:46:03,560 au utangulizi chochote kwa r. 821 00:46:03,560 --> 00:46:05,584 Siyo uwezekano wewe ni kwenda haja yake. 822 00:46:05,584 --> 00:46:08,000 Hata hivyo, nimeona ni muhimu katika miradi yangu mwenyewe huko nyuma. 823 00:46:08,000 --> 00:46:10,984 Na hakuna nzuri mafunzo kwa ajili yake online. 824 00:46:10,984 --> 00:46:12,900 Hivyo nina kwenda tu kwa kukimbilia wewe wote kwa njia hii 825 00:46:12,900 --> 00:46:16,606 na kisha uko huru kuondoka. 826 00:46:16,606 --> 00:46:18,480 Na hivyo kigeni kazi interface ni nini 827 00:46:18,480 --> 00:46:23,130 unaweza kutumia kuwaita nje kuona kazi na R. Ndani, 828 00:46:23,130 --> 00:46:29,850 R imejengwa juu ya hesabu C. R ni tu C ya hesabu 64-bit hatua floating, 829 00:46:29,850 --> 00:46:32,852 ambayo ni aina mbili [inaudible]. 830 00:46:32,852 --> 00:46:35,060 Na unaweza kutaka kufanya huu kwa kundi la sababu. 831 00:46:35,060 --> 00:46:39,250 Kwa moja, R ni kufasiriwa, ni si ulioandaliwa chini ya mashine kanuni. 832 00:46:39,250 --> 00:46:42,170 Hivyo unaweza kuandika upya yako mizunguko ndani katika C na kisha kupata 833 00:46:42,170 --> 00:46:45,920 Faida ya kutumia R. Kama ni kidogo zaidi rahisi zaidi kuliko C. 834 00:46:45,920 --> 00:46:48,899 Ina bora graphing vifaa vya na mengineyo. 835 00:46:48,899 --> 00:46:51,690 Na wakati bado kuwa na uwezo wa kupata kasi ya juu nje ya mizunguko ndani, 836 00:46:51,690 --> 00:46:53,650 ambayo ni ambapo wewe kweli haja yake. 837 00:46:53,650 --> 00:46:56,330 >> Reusing maktaba zilizopo C, hiyo ni muhimu pia. 838 00:46:56,330 --> 00:47:00,320 Kama una baadhi ya C maktaba kwa kama, Sijui, Fourier kuugeuza, 839 00:47:00,320 --> 00:47:05,190 au baadhi Archean sana utaratibu takwimu kutumika 840 00:47:05,190 --> 00:47:09,470 katika nishati astrofysik au kitu, sijui. 841 00:47:09,470 --> 00:47:13,058 Juu ya nishati astrofysik ni hata kufikiri, nadhani. 842 00:47:13,058 --> 00:47:16,480 Lakini unaweza kufanya hivyo badala ya kuwa kuandika bandari ya asili R wao. 843 00:47:16,480 --> 00:47:22,725 Na juu ya the-- na tena, kama kama wewe kuangalia katika maeneo mengi ya maktaba chaguo-msingi R, 844 00:47:22,725 --> 00:47:25,600 juu ya internals, internals ni kwenda kutumia kazi nje 845 00:47:25,600 --> 00:47:26,724 interface sana sana. 846 00:47:26,724 --> 00:47:31,630 Wao itabidi mambo kama Fourier kuugeuza au kompyuta uwiano 847 00:47:31,630 --> 00:47:34,890 coefficients yaliyoandikwa katika C, na wao itabidi tu na R wrappers inayowazunguka. 848 00:47:34,890 --> 00:47:38,230 Interface ni gumu kidogo. nafikiri 849 00:47:38,230 --> 00:47:43,750 ugumu wake ni chumvi katika mengi ya maelekezo utapata. 850 00:47:43,750 --> 00:47:46,200 Lakini hata hivyo, ni kidogo utata. 851 00:47:46,200 --> 00:47:48,650 Na mimi hawajaweza kwa kupata mafunzo mema kwa ajili yake, 852 00:47:48,650 --> 00:47:51,980 hivyo hii ni haki sasa. 853 00:47:51,980 --> 00:47:55,360 Tena, hii sehemu nzima ni zaidi kwa ajili ya kumbukumbu baadaye. 854 00:47:55,360 --> 00:47:57,687 Msiwe na wasiwasi juu ya kuiga kila kitu chini hivi sasa. 855 00:47:57,687 --> 00:48:00,020 Hivyo maelekezo yafuatayo ni kwa Unix-kama, 856 00:48:00,020 --> 00:48:05,150 Linux, BSD, OS X. Sijui jinsi hii matendo juu ya Windows, 857 00:48:05,150 --> 00:48:08,280 lakini tafadhali si tu kufanya yako mradi wa mwisho juu ya Windows. 858 00:48:08,280 --> 00:48:10,790 859 00:48:10,790 --> 00:48:12,460 Wewe kweli sitaki. 860 00:48:12,460 --> 00:48:14,770 Unix ni bora zaidi ya kuweka hadi kwa programu ya kawaida. 861 00:48:14,770 --> 00:48:19,320 862 00:48:19,320 --> 00:48:21,390 Kwa hiyo, kimsingi kigeni kazi interface. 863 00:48:21,390 --> 00:48:24,420 Kama unataka kuandika C kazi kwa ajili ya matumizi na R, 864 00:48:24,420 --> 00:48:27,250 ina kuchukua yote hoja kama kuyatumia. 865 00:48:27,250 --> 00:48:30,666 >> Hivyo kwa maadili moja, hii ina maana ni alisema kwa thamani. 866 00:48:30,666 --> 00:48:33,040 Kwa arrays, hii ni pointer kwa kipengele kwanza, ambayo 867 00:48:33,040 --> 00:48:36,750 ni nini majina safu kweli maana. 868 00:48:36,750 --> 00:48:40,140 Tena, hii ni kitu unapaswa kuwa pretty kabisa chini baada ya kupita kuweka tano. 869 00:48:40,140 --> 00:48:43,334 Majina safu ni kuyatumia tu kwa kipengele kwanza, 870 00:48:43,334 --> 00:48:44,750 Aina floating-kumweka ni mara mbili. 871 00:48:44,750 --> 00:48:47,310 Na kazi yako ana kurudi tupu. 872 00:48:47,310 --> 00:48:50,810 Njia pekee ambayo inaweza kweli kuwaambia R kile kilichotokea 873 00:48:50,810 --> 00:48:54,410 ni kwa kubadilisha kumbukumbu kwamba alitoa R kwa kupitia kazi kigeni 874 00:48:54,410 --> 00:48:54,910 interface. 875 00:48:54,910 --> 00:48:58,180 876 00:48:58,180 --> 00:49:00,127 >> Hivyo nimekuwa imeandikwa huu mfano hapa, hii ni 877 00:49:00,127 --> 00:49:02,460 kazi ambayo computes matumizi dot bidhaa za vectors mbili. 878 00:49:02,460 --> 00:49:05,060 Inachukua hoja mbili, vec1, vec2, ambayo ni wadudu wenyewe, 879 00:49:05,060 --> 00:49:06,934 na kisha n, ambayo ni urefu, kwa sababu tena, 880 00:49:06,934 --> 00:49:12,630 R imejenga katika [inaudible] ili kujua urefu wa wadudu, lakini C hana. 881 00:49:12,630 --> 00:49:16,182 Katika C, vectors ni holela delimited chunk ya kumbukumbu. 882 00:49:16,182 --> 00:49:17,890 Hivyo njia unaweza mahesabu ya nukta bidhaa 883 00:49:17,890 --> 00:49:23,470 ni hii kuweka tu nje parameter na sifuri na kisha iterate kupitia 884 00:49:23,470 --> 00:49:28,760 kuanzia 1 mpaka nyota n, kwa sababu n pointer kwa urefu, 885 00:49:28,760 --> 00:49:32,929 tu kuongeza kitu cha hii nje parameter. 886 00:49:32,929 --> 00:49:34,970 Na inaweza kuwa mazoezi mazuri kama wewe ni kwenda kufanya 887 00:49:34,970 --> 00:49:37,270 huu kuandika mbili tofauti majukumu C. 888 00:49:37,270 --> 00:49:41,970 Mmoja wao has-- mmoja wao tu inachukua hoja na aina 889 00:49:41,970 --> 00:49:43,970 kwamba wangeweza kawaida kuwa katika C. 890 00:49:43,970 --> 00:49:47,780 >> Hivyo Inachukua safu hoja kama kuyatumia. 891 00:49:47,780 --> 00:49:57,090 Lakini hoja moja-thamani kama n, tu inachukua kama maadili na nakala, 892 00:49:57,090 --> 00:49:57,917 bila kuyatumia. 893 00:49:57,917 --> 00:49:59,750 Na kisha hana [Inaudible] nje pointer. 894 00:49:59,750 --> 00:50:01,290 Na kisha unaweza kuwa tofauti, kimsingi, 895 00:50:01,290 --> 00:50:03,623 kanga kazi ambayo kimsingi kinafanya mahitaji 896 00:50:03,623 --> 00:50:07,740 ya kazi kigeni interface kwa ajili yenu. 897 00:50:07,740 --> 00:50:11,840 >> Njia wewe piga huu katika R ni, mara moja una kazi yako imeandikwa katika C, 898 00:50:11,840 --> 00:50:17,770 aina R cmd shlib, R amri ya pamoja maktaba, 899 00:50:17,770 --> 00:50:20,110 foo nukta c, au chochote jina la faili yako ilipo, 900 00:50:20,110 --> 00:50:23,020 na OS shell si katika R wastaafu. 901 00:50:23,020 --> 00:50:25,200 Na hii itakuwa kujenga maktaba kuitwa foo dot hivyo. 902 00:50:25,200 --> 00:50:28,180 Na kisha unaweza mzigo katika Muswada wetu au interactively 903 00:50:28,180 --> 00:50:32,310 na amri Dyn nukta mzigo. 904 00:50:32,310 --> 00:50:35,720 Halafu kuna kazi katika R aitwaye nukta c. 905 00:50:35,720 --> 00:50:39,310 >> Hii inachukua hoja kwamba ni kwanza jina la kazi katika C 906 00:50:39,310 --> 00:50:40,970 kwamba unataka simu. 907 00:50:40,970 --> 00:50:43,920 Na kisha vigezo vyote kwa kazi hiyo, 908 00:50:43,920 --> 00:50:45,420 wao kuwa katika utaratibu sahihi. 909 00:50:45,420 --> 00:50:48,580 Una kutumia hizi aina kazi ya kutumia nguvu kama integer, kama 910 00:50:48,580 --> 00:50:52,050 mara mbili, kama tabia, na kama mantiki. 911 00:50:52,050 --> 00:50:54,710 Na kisha wakati kuirudisha orodha, ambayo tena ni 912 00:50:54,710 --> 00:50:57,550 safu kuhusishwa wa majina parameter na maadili 913 00:50:57,550 --> 00:51:00,950 baada ya kazi umeisha. 914 00:51:00,950 --> 00:51:08,520 >> Hivyo katika kesi hii, kwa sababu nukta Prod ina hoja vec1, vec2, na int n, n nje. 915 00:51:08,520 --> 00:51:11,980 Dot c tuna nukta Prod, jina la kazi 916 00:51:11,980 --> 00:51:16,250 sisi ni wito, vec1, vec2, aina kushinikiza. 917 00:51:16,250 --> 00:51:20,060 Urefu wa ama vector, I just waliamua vec1 kiholela. 918 00:51:20,060 --> 00:51:25,479 Itakuwa imara zaidi kwa kusema s integer dk urefu wa vec1, urefu vec2. 919 00:51:25,479 --> 00:51:27,520 Basi tu kama mara mbili sifuri, kwa sababu sisi si kweli 920 00:51:27,520 --> 00:51:29,644 huduma yale yanayoendelea ndani nje parameter kwa sababu tuko 921 00:51:29,644 --> 00:51:32,270 kuiandaa kwa sifuri hata hivyo. 922 00:51:32,270 --> 00:51:37,560 >> Na kisha matokeo ni kwenda kuwa kubwa kuhusishwa safu ya kimsingi 923 00:51:37,560 --> 00:51:42,090 vec1 ni chochote, vec2 ni chochote. 924 00:51:42,090 --> 00:51:44,330 Lakini sisi ni nia ya nje, ili tuweze kupata kwamba nje. 925 00:51:44,330 --> 00:51:47,780 Hii ni mara ya pili, sana toy mfano ya kazi ya nje interface. 926 00:51:47,780 --> 00:51:54,160 Lakini kama una compute nukta bidhaa za wadudu mkubwa katika mizunguko, 927 00:51:54,160 --> 00:51:56,960 au kama una kufanya kitu kingine katika kitanzi, 928 00:51:56,960 --> 00:51:59,850 na wewe hawataki wanategemea R, ambayo haina kuwa na kidogo ya uendeshaji 929 00:51:59,850 --> 00:52:02,830 kujengwa ndani yake, hii inaweza kuwa na manufaa. 930 00:52:02,830 --> 00:52:05,870 >> Tena, hii si kawaida mada ya utangulizi kwa R. 931 00:52:05,870 --> 00:52:08,571 Siyo vizuri sana kumbukumbu. 932 00:52:08,571 --> 00:52:11,070 Mimi tu ikiwa ni pamoja na ni kwa sababu Nimeona ni muhimu katika siku za nyuma. 933 00:52:11,070 --> 00:52:13,654 Mazoea hivyo, mbaya. 934 00:52:13,654 --> 00:52:15,820 Mimi alieleza kuwa kuna kwa kitanzi katika kazi. 935 00:52:15,820 --> 00:52:21,150 Kwa ujumla unapaswa, katika lugha, si matumizi yake. 936 00:52:21,150 --> 00:52:26,100 Kulingana na jinsi R kutekeleza iteration ndani, inaweza kuwa polepole. 937 00:52:26,100 --> 00:52:28,540 Wao tu pia kuangalia mbaya. 938 00:52:28,540 --> 00:52:32,410 >> R kinafanya wadudu vizuri mno, hivyo mara nyingi huna haja ya kutumia hiyo. 939 00:52:32,410 --> 00:52:35,050 940 00:52:35,050 --> 00:52:38,900 Basi unaweza kawaida kuchukua nafasi vector mara nyingi 941 00:52:38,900 --> 00:52:42,490 kwa kazi hizi inaitwa juu kazi ili, Ramani, Kupunguza, 942 00:52:42,490 --> 00:52:44,404 Kupata, au Filter. 943 00:52:44,404 --> 00:52:46,320 Mimi itabidi kutoa baadhi ya mifano ya nini hawa kufanya. 944 00:52:46,320 --> 00:52:49,957 Ramani ni kubwa ili kazi kwa sababu inachukua kazi kama hoja. 945 00:52:49,957 --> 00:52:52,290 Hivyo unaweza kuwapa kazi, unaweza kuwapa safu, 946 00:52:52,290 --> 00:52:54,640 na itakuwa kuomba kazi kwa kila kipengele cha safu 947 00:52:54,640 --> 00:52:55,681 na kurudi safu mpya. 948 00:52:55,681 --> 00:52:58,035 949 00:52:58,035 --> 00:53:00,160 Kupunguza, kimsingi wewe kutoa ni safu, wewe kuwapa 950 00:53:00,160 --> 00:53:02,930 kazi ambayo inachukua hoja mbili. 951 00:53:02,930 --> 00:53:07,100 Itakuwa kuomba kazi ya kwanza, kwanza hoja na baadhi thamani starter. 952 00:53:07,100 --> 00:53:09,440 Kisha kwa kuwa matokeo katika pili. 953 00:53:09,440 --> 00:53:12,590 Kisha kwa kuwa matokeo katika tatu, kisha kwa kuwa matokeo katika nne. 954 00:53:12,590 --> 00:53:14,870 Na kisha kurudi wakati anapata hadi mwisho. 955 00:53:14,870 --> 00:53:17,620 Hivyo kwa mfano, kama unataka compute Jumla ya mambo yote 956 00:53:17,620 --> 00:53:23,240 katika safu, kuliko unavyoweza kuita kupunguza na [inaudible] kupunguza Mbali 957 00:53:23,240 --> 00:53:26,620 kazi, kama func a, b, kurudi pamoja b. 958 00:53:26,620 --> 00:53:28,960 Na kisha kuanza thamani ya 0. 959 00:53:28,960 --> 00:53:32,950 >> Watu hawa wote unaweza kupata yao ilivyoelezwa katika R nyaraka, 960 00:53:32,950 --> 00:53:35,720 katika yoyote vitabu vya kiada juu ya kazi ya programu. 961 00:53:35,720 --> 00:53:38,330 Kuna pia darasa hili la kazi aitwaye kuomba kazi, 962 00:53:38,330 --> 00:53:42,807 ambayo mimi do not wao uko kidogo vigumu kueleza, 963 00:53:42,807 --> 00:53:45,640 lakini kama ukiangalia katika [inaudible] kadi ya njano kwamba mimi alitoa mfano mwanzoni, 964 00:53:45,640 --> 00:53:48,615 anaelezea yao pretty vizuri katika nyongeza yake juu ya R programu. 965 00:53:48,615 --> 00:53:51,599 966 00:53:51,599 --> 00:53:53,390 Zaidi kuhusu desturi, appending kwa wadudu. 967 00:53:53,390 --> 00:53:57,570 968 00:53:57,570 --> 00:53:58,070 Yeah? 969 00:53:58,070 --> 00:54:01,651 970 00:54:01,651 --> 00:54:02,900 Nadhani lazima kusahihisha jambo hilo. 971 00:54:02,900 --> 00:54:07,450 Katika kwamba mstari wa kwanza, Zaidi sehemu mshale, kwamba mshale haipaswi kuwa hapo. 972 00:54:07,450 --> 00:54:10,920 Unaweza hawawajui kwa vector, tena, na kuchukua urefu wake pamoja na 1 973 00:54:10,920 --> 00:54:13,220 na kumshirikisha baadhi thamani ya jambo hilo. 974 00:54:13,220 --> 00:54:18,970 Ambayo kupanua vector, au wewe anaweza kufanya Zaidi sehemu sawa c, Zaidi sehemu newvalue. 975 00:54:18,970 --> 00:54:21,540 Tena, kama wewe kutumia C kwa Hoja moja kama vector, 976 00:54:21,540 --> 00:54:23,300 uongozi kusababisha anapata bapa. 977 00:54:23,300 --> 00:54:27,160 Hivyo itabidi kupata tu vector hiyo ni kupanuliwa kwa 1. 978 00:54:27,160 --> 00:54:30,410 Kamwe kufanya hivyo. 979 00:54:30,410 --> 00:54:33,330 >> Sababu kwa nini haipaswi kufanya hivyo ni hii. 980 00:54:33,330 --> 00:54:37,430 Wakati kutenga vector, ni anatoa ni chunk fulani ya kumbukumbu. 981 00:54:37,430 --> 00:54:40,680 Kama kuongeza kwamba ukubwa vector, ina reallocate vector 982 00:54:40,680 --> 00:54:43,820 mahali pengine. 983 00:54:43,820 --> 00:54:46,980 Na hivyo uhamishaji ni ghali kabisa. 984 00:54:46,980 --> 00:54:50,530 Mimi si kwenda katika maelezo ya jinsi allocators kumbukumbu zinatekelezwa 985 00:54:50,530 --> 00:54:57,280 katika ngazi ya mfumo wa uendeshaji, lakini inachukua muda mwingi 986 00:54:57,280 --> 00:54:58,962 kupata chunk mpya ya kumbukumbu. 987 00:54:58,962 --> 00:55:00,920 Na pia, kama wewe ni re-kugawa kura na kura 988 00:55:00,920 --> 00:55:03,500 ya kuendelea kubwa chunks, unaishia 989 00:55:03,500 --> 00:55:06,420 na kitu kinachoitwa kumbukumbu kugawanyika, 990 00:55:06,420 --> 00:55:09,390 ambapo kumbukumbu inayopatikana ni imegawanywa katika kura ya vitalu kidogo 991 00:55:09,390 --> 00:55:11,500 katika allocators kumbukumbu hatua ya maoni. 992 00:55:11,500 --> 00:55:15,340 Na anapata vigumu na vigumu kupata kumbukumbu kwa mambo mengine. 993 00:55:15,340 --> 00:55:19,455 Hivyo badala, kama wewe haja ya kufanya hivyo, kama unahitaji kukua vector kutoka upande mmoja 994 00:55:19,455 --> 00:55:24,240 ijayo, badala ya appending hiyo mara kwa mara, unapaswa kabla ya kutenga yake. 995 00:55:24,240 --> 00:55:29,310 Zaidi sehemu mshale, urefu vector sawa 1,000, au chochote. 996 00:55:29,310 --> 00:55:33,200 >> Na kisha unaweza tu kuwapa kwa vector ya maadili moja 997 00:55:33,200 --> 00:55:36,000 muda baada ya umefanya zilizotengwa ni mara moja. 998 00:55:36,000 --> 00:55:40,140 Mimi mbio katika hili, tena, majira yangu kazi wakati mimi nilikuwa kuandika NRA tofauti 999 00:55:40,140 --> 00:55:42,120 equation solver. 1000 00:55:42,120 --> 00:55:43,180 Si ishara namba. 1001 00:55:43,180 --> 00:55:49,290 Wazo ni kwamba mara moja una thamani moja kwa ufumbuzi wako, 1002 00:55:49,290 --> 00:55:51,240 matumizi kwamba compute moja ijayo. 1003 00:55:51,240 --> 00:55:53,700 Hivyo asili yangu wasiojua kupendelea ilikuwa ni kusema sawa, 1004 00:55:53,700 --> 00:55:56,930 hivyo mimi itabidi kuanza na vector hiyo ni thamani kubwa. 1005 00:55:56,930 --> 00:56:01,260 Compute na ile thamani ijayo kwamba huenda kwenye ufumbuzi yangu vector, 1006 00:56:01,260 --> 00:56:02,630 na append kwamba. 1007 00:56:02,630 --> 00:56:05,290 >> Kujenga kitu kingine, append kwamba. 1008 00:56:05,290 --> 00:56:08,120 Ni akaenda sana, polepole sana. 1009 00:56:08,120 --> 00:56:11,540 Na mara moja nikagundua hii na mimi iliyopita mfumo wangu 1010 00:56:11,540 --> 00:56:16,020 kutoka appending kwa vector hii kama mara 10,000 hadi 100,000, 1011 00:56:16,020 --> 00:56:18,910 tu kabla ya kugawa vector na kukimbia tu na kwamba. 1012 00:56:18,910 --> 00:56:22,100 I got zaidi ya 1,000 mara kasi. 1013 00:56:22,100 --> 00:56:26,280 Hivyo hii ni ya kawaida sana mtego kwa R programu. 1014 00:56:26,280 --> 00:56:31,560 Kama unahitaji kujenga vector kipande kwa kipande, kabla ya kutenga yake. 1015 00:56:31,560 --> 00:56:35,360 1016 00:56:35,360 --> 00:56:40,240 >> Safari nyingine ya kawaida up-- hii ni mwisho wangu slide, hawana worry-- ni kosa utunzaji. 1017 00:56:40,240 --> 00:56:42,890 R, kwa kuwa mkweli, hana kweli kufanya hivyo vizuri sana. 1018 00:56:42,890 --> 00:56:45,010 Kuna mengi ya matatizo ambayo yanaweza kutokea. 1019 00:56:45,010 --> 00:56:48,360 Kwa mfano, kama wewe kupata safu au vector nje ya kazi 1020 00:56:48,360 --> 00:56:52,377 kuwa wewe walikuwa wanatarajia moja thamani ya wanatoka, au kinyume chake, 1021 00:56:52,377 --> 00:56:55,460 na kupita kwamba katika kazi hiyo wewe aliandika wanatarajia thamani moja, 1022 00:56:55,460 --> 00:56:57,270 ambayo inaweza kuwa tatizo. 1023 00:56:57,270 --> 00:57:01,440 >> Baadhi ya kazi kurudi null kama kufanya, kusema, 1024 00:57:01,440 --> 00:57:05,560 kusoma kutoka haipo muhimu katika orodha. 1025 00:57:05,560 --> 00:57:08,527 Lakini null sio kama C ambapo kama wewe kujaribu kusoma 1026 00:57:08,527 --> 00:57:11,360 kutoka pointer zamani, [inaudible] kwa null pointer, ni tu seg makosa 1027 00:57:11,360 --> 00:57:14,109 na kama wewe ni katika HatiJava yako ni anaelezea kwa uhakika ni wapi. 1028 00:57:14,109 --> 00:57:17,080 1029 00:57:17,080 --> 00:57:20,772 Badala yake, itakuwa null do-- kazi kufanya mambo haitabiriki 1030 00:57:20,772 --> 00:57:21,730 kama uko mitupu null. 1031 00:57:21,730 --> 00:57:24,575 Kama kama wewe ni mitupu max null, nitakupa infinity hasi. 1032 00:57:24,575 --> 00:57:27,230 1033 00:57:27,230 --> 00:57:28,190 Na hivyo, yeah. 1034 00:57:28,190 --> 00:57:30,880 1035 00:57:30,880 --> 00:57:32,630 Na hivyo hii ilitokea kwangu mara moja wakati mimi nilikuwa 1036 00:57:32,630 --> 00:57:34,771 iliyopita kundi la mashamba katika orodha ya muundo yangu 1037 00:57:34,771 --> 00:57:37,520 mara moja bila kubadilisha yao mahali pengine wakati mimi nilikuwa kusoma kutoka kwao. 1038 00:57:37,520 --> 00:57:40,670 Na kisha I got kila aina ya random matokeo infinity mseto juu 1039 00:57:40,670 --> 00:57:43,080 na mimi sijui walikotoka. 1040 00:57:43,080 --> 00:57:45,310 Na kwa bahati mbaya, kuna hakuna halisi R hali kali 1041 00:57:45,310 --> 00:57:48,940 ambapo unaweza kusema kama kitu fulani Inaonekana kama inaweza kuwa kosa, 1042 00:57:48,940 --> 00:57:51,960 kuacha tu pale ili niweze kuwa nidhamu na kurekebisha hiyo. 1043 00:57:51,960 --> 00:57:55,282 1044 00:57:55,282 --> 00:57:57,240 Hata hivyo, kuna kitu aitwaye stop kama si. 1045 00:57:57,240 --> 00:58:00,480 Hii ni sawa na C ya kudai, kama umefanya kuongelea hilo. 1046 00:58:00,480 --> 00:58:02,690 Sidhani C kudai ni hotuba ya mada, 1047 00:58:02,690 --> 00:58:06,370 lakini kifungu cha kiongozi wako anaweza kuwa wamekwenda juu yake. 1048 00:58:06,370 --> 00:58:10,393 Na kuacha kama si kimsingi inachukua yoyote prediketo, hivyo kauli yoyote kwamba 1049 00:58:10,393 --> 00:58:11,824 inaweza kuwa kweli au uongo. 1050 00:58:11,824 --> 00:58:13,490 Na kama ni uongo, ataacha mpango wake. 1051 00:58:13,490 --> 00:58:18,260 Ni anaelezea kwa uhakika nini kujipanga wewe walikuwa kwenye na nini hali alishindwa. 1052 00:58:18,260 --> 00:58:21,910 >> Na hii ni muhimu sana, kwa mfano, sanity kuangalia, kazi pembejeo. 1053 00:58:21,910 --> 00:58:25,110 Hivyo kama una kazi na unatarajia, kusema, 1054 00:58:25,110 --> 00:58:29,640 kama ni lazima nipe tarehe, nataka tarehe kuwa tu vector ya urefu 1 1055 00:58:29,640 --> 00:58:31,735 na mahali fulani kati ya 1 na 31. 1056 00:58:31,735 --> 00:58:34,420 1057 00:58:34,420 --> 00:58:36,170 Na kama sio, mimi kujua kitu ni gone vibaya. 1058 00:58:36,170 --> 00:58:40,280 Na mimi kuchagua kuacha hapo kabla huu ina random kubisha juu ya madhara kwa kificho 1059 00:58:40,280 --> 00:58:44,190 kwamba ni vigumu kwa kuwaeleza njia ya. 1060 00:58:44,190 --> 00:58:47,170 Hivyo hiyo ni moja iwezekanavyo kutumia kwa kuacha kama si. 1061 00:58:47,170 --> 00:58:48,660 >> Hata hivyo, sawa. 1062 00:58:48,660 --> 00:58:49,690 Hivyo hiyo ni mwisho. 1063 00:58:49,690 --> 00:58:51,290 Asante sana kwa kuja. 1064 00:58:51,290 --> 00:58:53,710 Mimi ni cheo Amateur katika hili. 1065 00:58:53,710 --> 00:58:57,270 Hivyo pole kama wewe ni kuchoka au kuchanganyikiwa au una nini. 1066 00:58:57,270 --> 00:59:01,670 Nina furaha kwa kuchukua maswali kwa barua pepe katika connorharris@college.harvard.edu. 1067 00:59:01,670 --> 00:59:07,230 Hii inakwenda pia kwa kila mtu kuangalia hii kuishi au baadaye. 1068 00:59:07,230 --> 00:59:10,190 Pia, ingawa mimi si TF, mimi pia sana 1069 00:59:10,190 --> 00:59:13,900 tayari kutumika kama rasmi mshauri kwa mtu yeyote ambaye ni 1070 00:59:13,900 --> 00:59:15,460 kutumia R katika mradi wa mwisho. 1071 00:59:15,460 --> 00:59:19,900 >> Kama Ningependa kwa hiyo, basi tu kuzungumza na TF yako 1072 00:59:19,900 --> 00:59:23,750 na kisha kuandika barua pepe yangu hivyo Mimi najua kile wewe ni kazi ya 1073 00:59:23,750 --> 00:59:26,680 na hivyo siwezi kuanzisha mkutano mara kwa wewe kama unataka. 1074 00:59:26,680 --> 00:59:27,990 Hivyo tena, asante sana. 1075 00:59:27,990 --> 00:59:28,960 Natumaini walifurahia yake. 1076 00:59:28,960 --> 00:59:29,450 >> Watazamaji: [inaudible]. 1077 00:59:29,450 --> 00:59:30,617 >> CONNER HARRIS: Bila shaka. 1078 00:59:30,617 --> 00:59:34,910 >> Watazamaji: Ni aina gani ya mradi ingekuwa CS mwanafunzi kutumia R kwa? 1079 00:59:34,910 --> 00:59:37,427 1080 00:59:37,427 --> 00:59:40,510 CONNER HARRIS: Hivyo kama wewe ni kufanya kitu ambacho ni rena katika madini data, 1081 00:59:40,510 --> 00:59:43,790 Kwa mfano, kuna na mambo mengi 1082 00:59:43,790 --> 00:59:46,692 unaweza kufanya na kwamba pamoja na data madini na kujifunza mashine. 1083 00:59:46,692 --> 00:59:48,900 Unaweza kutaka kutumia R kwa sehemu ya kitu fulani. 1084 00:59:48,900 --> 00:59:52,022 Mimi kuletwa hadi, awali, mfano ya kama wewe ni kuandika tovuti 1085 00:59:52,022 --> 00:59:54,730 na unataka kukimbia automatiska uchambuzi wa takwimu za kompyuta yako 1086 00:59:54,730 --> 00:59:57,990 magogo wakati fulani kila siku, hiyo inaweza kuwa ni kitu ambacho ni 1087 00:59:57,990 --> 01:00:01,260 rahisi sana kufanya katika tu kwa kifupi R script kwamba unaweza ratiba 1088 01:00:01,260 --> 01:00:04,200 kuendesha kila usiku, kwa mfano. 1089 01:00:04,200 --> 01:00:06,550 >> Na mimi nina uhakika, kama kuna sababu yoyote wewe d 1090 01:00:06,550 --> 01:00:11,520 wanataka takwimu au uwezo graphing na kuwa na kukimbia hii moja kwa moja badala 1091 01:00:11,520 --> 01:00:13,790 ya kuwa na kuingiliana pamoja na mambo katika Excel, 1092 01:00:13,790 --> 01:00:16,750 Kwa mfano, hiyo ni kitu unaweza kutaka kutumia R kwa. 1093 01:00:16,750 --> 01:00:21,190 Hivyo maswali yoyote zaidi kabla ya mimi kuondoka? 1094 01:00:21,190 --> 01:00:21,690 Hakuna? 1095 01:00:21,690 --> 01:00:24,960 Haki wote, vizuri, tena, kumshukuru sana kwa kuja. 1096 01:00:24,960 --> 01:00:29,417