Ideas worth spreading

Get the perfect ideas,

selected just for you

TED日本語

TED Talks（英語日本語字幕付き動画）

TED日本語 - ジェレミー・ハワード: 自ら学習するコンピュータの素晴らしくも物恐ろしい可能性

TED Talks

自ら学習するコンピュータの素晴らしくも物恐ろしい可能性

The wonderful and terrifying implications of computers that can learn

ジェレミー・ハワード

Jeremy Howard

内容

コンピューターに学び方を教えた時何が起きるのでしょう？科学技術者であるジェレミー・ハワードが、急速に発展しているディープ・ラーニングの分野で起きている驚くべきことを紹介してくれます。コンピューターが中国語を学び、写真に写っている物を認識し、医療診断をする。（あるディープ・ラーニング・プログラムは何時間ものYouTubeビデオを見た後、「猫」の概念を自ら学び取りました。）この分野の最新動向を押さえておくことにしましょう。それは私達の身の回りのコンピューターの振る舞いを変えることになるでしょうから･･･あなたが思っているよりも早く。

カテゴリ

科学と技術

コンピューター

タグ　　: TED日本語

外部リンク: TED｜ジェレミー・ハワード: 自ら学習するコンピュータの素晴らしくも物恐ろしい可能性

字幕

SCRIPT

Script

It used to be that if you wanted to get a computer to do something new, you would have to program it. Now, programming, for those of you here that haven't done it yourself, requires laying out in excruciating detail every single step that you want the computer to do in order to achieve your goal. Now, if you want to do something that you don't know how to do yourself, then this is going to be a great challenge.

So this was the challenge faced by this man, Arthur Samuel. In 1956, he wanted to get this computer to be able to beat him at checkers. How can you write a program, lay out in excruciating detail, how to be better than you at checkers? So he came up with an idea: he had the computer play against itself thousands of times and learn how to play checkers. And indeed it worked, and in fact, by 1962, this computer had beaten the Connecticut state champion.

So Arthur Samuel was the father of machine learning, and I have a great debt to him, because I am a machine learning practitioner. I was the president of Kaggle, a community of over 200,000 machine learning practictioners. Kaggle puts up competitions to try and get them to solve previously unsolved problems, and it's been successful hundreds of times. So from this vantage point, I was able to find out a lot about what machine learning can do in the past, can do today, and what it could do in the future. Perhaps the first big success of machine learning commercially was Google. Google showed that it is possible to find information by using a computer algorithm, and this algorithm is based on machine learning. Since that time, there have been many commercial successes of machine learning. Companies like Amazon and Netflix use machine learning to suggest products that you might like to buy, movies that you might like to watch. Sometimes, it's almost creepy. Companies like LinkedIn and Facebook sometimes will tell you about who your friends might be and you have no idea how it did it, and this is because it's using the power of machine learning. These are algorithms that have learned how to do this from data rather than being programmed by hand.

This is also how IBM was successful in getting Watson to beat the two world champions at "Jeopardy," answering incredibly subtle and complex questions like this one. [ "The ancient 'Lion of Nimrud' went missing from this city's national museum in 2003 (along with a lot of other stuff)" ] This is also why we are now able to see the first self-driving cars. If you want to be able to tell the difference between, say, a tree and a pedestrian, well, that's pretty important. We don't know how to write those programs by hand, but with machine learning, this is now possible. And in fact, this car has driven over a million miles without any accidents on regular roads.

So we now know that computers can learn, and computers can learn to do things that we actually sometimes don't know how to do ourselves, or maybe can do them better than us. One of the most amazing examples I've seen of machine learning happened on a project that I ran at Kaggle where a team run by a guy called Geoffrey Hinton from the University of Toronto won a competition for automatic drug discovery. Now, what was extraordinary here is not just that they beat all of the algorithms developed by Merck or the international academic community, but nobody on the team had any background in chemistry or biology or life sciences, and they did it in two weeks. How did they do this? They used an extraordinary algorithm called deep learning. So important was this that in fact the success was covered in The New York Times in a front page article a few weeks later. This is Geoffrey Hinton here on the left-hand side. Deep learning is an algorithm inspired by how the human brain works, and as a result it's an algorithm which has no theoretical limitations on what it can do. The more data you give it and the more computation time you give it, the better it gets.

The New York Times also showed in this article another extraordinary result of deep learning which I'm going to show you now. It shows that computers can listen and understand.

(Video) Richard Rashid: Now, the last step that I want to be able to take in this process is to actually speak to you in Chinese. Now the key thing there is, we've been able to take a large amount of information from many Chinese speakers and produce a text-to-speech system that takes Chinese text and converts it into Chinese language, and then we've taken an hour or so of my own voice and we've used that to modulate the standard text-to-speech system so that it would sound like me. Again, the result's not perfect. There are in fact quite a few errors. (In Chinese) (Applause) There's much work to be done in this area. (In Chinese) (Applause)

Jeremy Howard: Well, that was at a machine learning conference in China. It's not often, actually, at academic conferences that you do hear spontaneous applause, although of course sometimes at TEDx conferences, feel free. Everything you saw there was happening with deep learning. (Applause) Thank you. The transcription in English was deep learning. The translation to Chinese and the text in the top right, deep learning, and the construction of the voice was deep learning as well.

So deep learning is this extraordinary thing. It's a single algorithm that can seem to do almost anything, and I discovered that a year earlier, it had also learned to see. In this obscure competition from Germany called the German Traffic Sign Recognition Benchmark, deep learning had learned to recognize traffic signs like this one. Not only could it recognize the traffic signs better than any other algorithm, the leaderboard actually showed it was better than people, about twice as good as people. So by 2011, we had the first example of computers that can see better than people. Since that time, a lot has happened. In 2012, Google announced that they had a deep learning algorithm watch YouTube videos and crunched the data on 16,000 computers for a month, and the computer independently learned about concepts such as people and cats just by watching the videos. This is much like the way that humans learn. Humans don't learn by being told what they see, but by learning for themselves what these things are. Also in 2012, Geoffrey Hinton, who we saw earlier, won the very popular ImageNet competition, looking to try to figure out from one and a half million images what they're pictures of. As of 2014, we're now down to a six percent error rate in image recognition. This is better than people, again.

So machines really are doing an extraordinarily good job of this, and it is now being used in industry. For example, Google announced last year that they had mapped every single location in France in two hours, and the way they did it was that they fed street view images into a deep learning algorithm to recognize and read street numbers. Imagine how long it would have taken before: dozens of people, many years. This is also happening in China. Baidu is kind of the Chinese Google, I guess, and what you see here in the top left is an example of a picture that I uploaded to Baidu's deep learning system, and underneath you can see that the system has understood what that picture is and found similar images. The similar images actually have similar backgrounds, similar directions of the faces, even some with their tongue out. This is not clearly looking at the text of a web page. All I uploaded was an image. So we now have computers which really understand what they see and can therefore search databases of hundreds of millions of images in real time.

So what does it mean now that computers can see? Well, it's not just that computers can see. In fact, deep learning has done more than that. Complex, nuanced sentences like this one are now understandable with deep learning algorithms. As you can see here, this Stanford-based system showing the red dot at the top has figured out that this sentence is expressing negative sentiment. Deep learning now in fact is near human performance at understanding what sentences are about and what it is saying about those things. Also, deep learning has been used to read Chinese, again at about native Chinese speaker level. This algorithm developed out of Switzerland by people, none of whom speak or understand any Chinese. As I say, using deep learning is about the best system in the world for this, even compared to native human understanding.

This is a system that we put together at my company which shows putting all this stuff together. These are pictures which have no text attached, and as I'm typing in here sentences, in real time it's understanding these pictures and figuring out what they're about and finding pictures that are similar to the text that I'm writing. So you can see, it's actually understanding my sentences and actually understanding these pictures. I know that you've seen something like this on Google, where you can type in things and it will show you pictures, but actually what it's doing is it's searching the webpage for the text. This is very different from actually understanding the images. This is something that computers have only been able to do for the first time in the last few months.

So we can see now that computers can not only see but they can also read, and, of course, we've shown that they can understand what they hear. Perhaps not surprising now that I'm going to tell you they can write. Here is some text that I generated using a deep learning algorithm yesterday. And here is some text that an algorithm out of Stanford generated. Each of these sentences was generated by a deep learning algorithm to describe each of those pictures. This algorithm before has never seen a man in a black shirt playing a guitar. It's seen a man before, it's seen black before, it's seen a guitar before, but it has independently generated this novel description of this picture. We're still not quite at human performance here, but we're close. In tests, humans prefer the computer-generated caption one out of four times. Now this system is now only two weeks old, so probably within the next year, the computer algorithm will be well past human performance at the rate things are going. So computers can also write.

So we put all this together and it leads to very exciting opportunities. For example, in medicine, a team in Boston announced that they had discovered dozens of new clinically relevant features of tumors which help doctors make a prognosis of a cancer. Very similarly, in Stanford, a group there announced that, looking at tissues under magnification, they've developed a machine learning-based system which in fact is better than human pathologists at predicting survival rates for cancer sufferers. In both of these cases, not only were the predictions more accurate, but they generated new insightful science. In the radiology case, they were new clinical indicators that humans can understand. In this pathology case, the computer system actually discovered that the cells around the cancer are as important as the cancer cells themselves in making a diagnosis. This is the opposite of what pathologists had been taught for decades. In each of those two cases, they were systems developed by a combination of medical experts and machine learning experts, but as of last year, we're now beyond that too. This is an example of identifying cancerous areas of human tissue under a microscope. The system being shown here can identify those areas more accurately, or about as accurately, as human pathologists, but was built entirely with deep learning using no medical expertise by people who have no background in the field. Similarly, here, this neuron segmentation. We can now segment neurons about as accurately as humans can, but this system was developed with deep learning using people with no previous background in medicine.

So myself, as somebody with no previous background in medicine, I seem to be entirely well qualified to start a new medical company, which I did. I was kind of terrified of doing it, but the theory seemed to suggest that it ought to be possible to do very useful medicine using just these data analytic techniques. And thankfully, the feedback has been fantastic, not just from the media but from the medical community, who have been very supportive. The theory is that we can take the middle part of the medical process and turn that into data analysis as much as possible, leaving doctors to do what they're best at. I want to give you an example. It now takes us about 15 minutes to generate a new medical diagnostic test and I'll show you that in real time now, but I've compressed it down to three minutes by cutting some pieces out. Rather than showing you creating a medical diagnostic test, I'm going to show you a diagnostic test of car images, because that's something we can all understand.

So here we're starting with about 1.5 million car images, and I want to create something that can split them into the angle of the photo that's being taken. So these images are entirely unlabeled, so I have to start from scratch. With our deep learning algorithm, it can automatically identify areas of structure in these images. So the nice thing is that the human and the computer can now work together. So the human, as you can see here, is telling the computer about areas of interest which it wants the computer then to try and use to improve its algorithm. Now, these deep learning systems actually are in 16, 000-dimensional space, so you can see here the computer rotating this through that space, trying to find new areas of structure. And when it does so successfully, the human who is driving it can then point out the areas that are interesting. So here, the computer has successfully found areas, for example, angles. So as we go through this process, we're gradually telling the computer more and more about the kinds of structures we're looking for. You can imagine in a diagnostic test this would be a pathologist identifying areas of pathosis, for example, or a radiologist indicating potentially troublesome nodules. And sometimes it can be difficult for the algorithm. In this case, it got kind of confused. The fronts and the backs of the cars are all mixed up. So here we have to be a bit more careful, manually selecting these fronts as opposed to the backs, then telling the computer that this is a type of group that we're interested in.

So we do that for a while, we skip over a little bit, and then we train the machine learning algorithm based on these couple of hundred things, and we hope that it's gotten a lot better. You can see, it's now started to fade some of these pictures out, showing us that it already is recognizing how to understand some of these itself. We can then use this concept of similar images, and using similar images, you can now see, the computer at this point is able to entirely find just the fronts of cars. So at this point, the human can tell the computer, okay, yes, you've done a good job of that.

Sometimes, of course, even at this point it's still difficult to separate out groups. In this case, even after we let the computer try to rotate this for a while, we still find that the left sides and the right sides pictures are all mixed up together. So we can again give the computer some hints, and we say, okay, try and find a projection that separates out the left sides and the right sides as much as possible using this deep learning algorithm. And giving it that hint -- ah, okay, it's been successful. It's managed to find a way of thinking about these objects that's separated out these together.

So you get the idea here. This is a case not where the human is being replaced by a computer, but where they're working together. What we're doing here is we're replacing something that used to take a team of five or six people about seven years and replacing it with something that takes 15 minutes for one person acting alone.

So this process takes about four or five iterations. You can see we now have 62 percent of our 1.5 million images classified correctly. And at this point, we can start to quite quickly grab whole big sections, check through them to make sure that there's no mistakes. Where there are mistakes, we can let the computer know about them. And using this kind of process for each of the different groups, we are now up to an 80 percent success rate in classifying the 1.5 million images. And at this point, it's just a case of finding the small number that aren't classified correctly, and trying to understand why. And using that approach, by 15 minutes we get to 97 percent classification rates.

So this kind of technique could allow us to fix a major problem, which is that there's a lack of medical expertise in the world. The World Economic Forum says that there's between a 10x and a 20x shortage of physicians in the developing world, and it would take about 300 years to train enough people to fix that problem. So imagine if we can help enhance their efficiency using these deep learning approaches?

So I'm very excited about the opportunities. I'm also concerned about the problems. The problem here is that every area in blue on this map is somewhere where services are over 80 percent of employment. What are services? These are services. These are also the exact things that computers have just learned how to do. So 80 percent of the world's employment in the developed world is stuff that computers have just learned how to do. What does that mean? Well, it'll be fine. They'll be replaced by other jobs. For example, there will be more jobs for data scientists. Well, not really. It doesn't take data scientists very long to build these things. For example, these four algorithms were all built by the same guy. So if you think, oh, it's all happened before, we've seen the results in the past of when new things come along and they get replaced by new jobs, what are these new jobs going to be? It's very hard for us to estimate this, because human performance grows at this gradual rate, but we now have a system, deep learning, that we know actually grows in capability exponentially. And we're here. So currently, we see the things around us and we say, "Oh, computers are still pretty dumb." Right? But in five years' time, computers will be off this chart. So we need to be starting to think about this capability right now.

We have seen this once before, of course. In the Industrial Revolution, we saw a step change in capability thanks to engines. The thing is, though, that after a while, things flattened out. There was social disruption, but once engines were used to generate power in all the situations, things really settled down. The Machine Learning Revolution is going to be very different from the Industrial Revolution, because the Machine Learning Revolution, it never settles down. The better computers get at intellectual activities, the more they can build better computers to be better at intellectual capabilities, so this is going to be a kind of change that the world has actually never experienced before, so your previous understanding of what's possible is different.

This is already impacting us. In the last 25 years, as capital productivity has increased, labor productivity has been flat, in fact even a little bit down.

So I want us to start having this discussion now. I know that when I often tell people about this situation, people can be quite dismissive. Well, computers can't really think, they don't emote, they don't understand poetry, we don't really understand how they work. So what? Computers right now can do the things that humans spend most of their time being paid to do, so now's the time to start thinking about how we're going to adjust our social structures and economic structures to be aware of this new reality. Thank you. (Applause)

これまではコンピューターに何かさせようと思ったらプログラムを書く必要がありましたプログラミングはやったことがないかもしれませんがやりたいことを実現するためにコンピューターが行うべきことを事細かに指定してやる必要がありますだから実現したいことの具体的なやり方を知らずにプログラムを書くというのは難しい話です

それがこの写真の人物アーサー・サミュエルが直面した問題でした 1956年のこと彼はチェッカーで自分に勝てるプログラムを作りたいと思いましたしかしどうしたら自分より上手くチェッカーを指す手順を詳細に記述することができるでしょう？彼は良い方法を思いつきましたコンピュータ自身を相手に何千回もチェッカーの対局をさせて自分で学ばせるんですこれはうまくいきましたそのプログラムは実際 1962年にコネチカット州チャンピオンを破ることができました

だからアーサー・サミュエルは機械学習の父とも言え私自身彼に多くを負っていますというのも私は機械学習の応用を生業としているからです私が代表を務めていた Keggleには20万人以上の機械学習専門家が属しています Keggleではかつて解かれたことのない課題を使って競技会を開催していて何百回となく成功を収めていますそのような立場から機械学習にはかつて何ができ今何ができて将来何ができるようになるか多くのことを学ぶことができました機械学習が商業的に大きな成功を収めた最初の例は Googleかもしれません Googleは機械学習を使ったアルゴリズムによって情報を見つけられることを示しましたそれ以来機械学習の商業的な成功事例がたくさん生まれています AmazonやNetflixのような企業は機械学習を使ってユーザーが買いたいであろう商品や見たいであろう映画を提示していてその精度は時に不気味なくらいです LinkedInやFacebookは知り合いかもしれない人を示唆しなぜ分かったのか当人には見当も付きませんがこれも機械学習の力を使っているのです手順が事細かにプログラミングされているのではなくどうすべきかをデータから学習するアルゴリズムが使われています

IBMのワトソンがジェパディの世界チャンピオン２人を破ったのもそのような方法によってでご覧のような複雑な問いに答えることができました [2003年にこの町の国立博物館から古代の “ニムルドの獅子” が (その他多くの品とともに) 消えた] 自動運転車が実現可能になったのも機械学習のお陰ですたとえば木と歩行者を見分けるといったことができる必要がありますそのようなことの具体的な手順がどうすれば書けるのか分かりませんが機械学習で可能になったのです事実この車は事故を起こすこともなく普通の公道を何百万キロも走行しています

コンピューターは単に学べるだけでなくどうしたらできるのか分からないようなことも学ぶことができ人間よりも上手くなることもあり得るのです機械学習で最も目覚ましい事例の１つは私がKeggleで主催したプロジェクトでジェフリー・ヒントン率いるトロント大のチームが薬を発見する競技に優勝した時ですこれがすごいのは医薬大手のメルク社やこの分野の専門家チームの開発したアルゴリズムを破った彼らのチームに化学や生物学やライフサイエンスをちゃんと学んだ経験のある人が誰もいなかったことでしかもたった２週間でやってのけたのですどうして可能だったのか？ディープ・ラーニングと呼ばれるアルゴリズムを使ったのですことの重大さは数週間後にニューヨークタイムズ紙の一面で取り上げられたことでも分かると思います画面の左に出ているのがジェフリー・ヒントンですディープ・ラーニングというのは人の脳の仕組みを参考にしたアルゴリズムで何が可能かについて理論的には限界がありませんより多くのデータと処理時間を使うほどより良い結果が得られます

ニューヨークタイムズはその記事でもう１つディープ・ラーニングのすごい事例を取り上げていますそれをこれからお見せしましょうコンピューターが人の話を聞いて理解できることを示すものです

(ビデオ) このプロセスの最後に加えたいステップは実際に中国語で話させるということですここで鍵になるのは中国語話者から得た膨大な情報を使って中国語のテキストを音声に変える音声合成システムを作り１時間ほどの私自身の声のデータを使ってそのシステムを調整しまるで私が話しているかのようにするということですまだ完璧なものではありませんたくさんミスをします (音声合成された中国語訳) (拍手) この領域で為されるべきことはまだたくさんあります (音声合成された中国語訳) (拍手)

これは中国で行われたカンファレンスでのものですが学会で拍手が沸き起こるというのはあまりないことですもっともTEDxはもっと自由な雰囲気がありますがご覧いただいたものはみんなディープ・ラーニングで実現されました (拍手) どうも英語の文字起こしもディープ・ラーニングだし右上の中国語に翻訳されたテキストもディープ・ラーニングによるもので音声の合成にもディープ・ラーニングが使われています

ディープ・ラーニングはこのようにすごいものです単一のアルゴリズムでほとんど何でもできるように見えますこの１年前にディープ・ラーニングが「見る」こともできると知りましたドイツ道路標識認識ベンチマークという奇妙な競技会でこのような道路標識をディープ・ラーニングで識別できることが示されました他のアルゴリズムよりも上手く識別できたというだけでなくこのスコアボードにある通り２位の人間より２倍高い精度で識別できたんです 2011年にはコンピューターが人よりも良く見ることができる事例が生まれたわけですそれ以来いろんなことが起きています 2012年にGoogleが発表したんですがディープ・ラーニング・アルゴリズムが YouTubeビデオを見て１万６千台のコンピュータで１ヶ月データ処理した結果コンピューターが「人」や「猫」といった概念を自分で学んだということですこれは人が学習する方法に近いものです人は見たものを教えられて学ぶよりはむしろそれが何なのか自分で学んでいくものです 2012年にはまた先ほど名前の出たジェフリー・ヒントンが有名なImageNet競技会で優勝しましたがこれは150万の画像を何の写真か判別するというものです 2014年の時点で画像認識の誤り率は 6%までになっていますこれも人間より高い精度です

機械はこの面で非常に良い仕事をするようになっており商業的にも利用されていますたとえばGoogleは去年フランス国内のすべての番地を２時間で地図に登録したと発表しましたその方法はストリートビューの画像をディープ・ラーニング・アルゴリズムに食わせて所番地を識別させるというものですかつてならどれほど時間を要したか分かりません何十人掛かりで何年もかかったでしょうこちらは中国の Baiduによるもので中国版のGoogle のようなサービスです左上の画像は私がBaiduのディープ・ラーニング・システムにアップロードしたものです下に並んでいるのはシステムがその画像を理解して似た画像を集めた結果です類似画像は似たような背景や似た顔の向きを持ち同じく舌を出してるものまでありますウェブページの文章によって見つけたものではありませんアップしたのは画像だけです今やコンピュータは見た物を理解して何億という画像のデータベースからリアルタイムで検索できるまでになっているのです

コンピュータに「見る」ことができるというのはどんな意味を持つのか？しかしできるのは見ることだけではありませんディープ・ラーニングにはそれ以上のことができますこのような複雑でニュアンスに富んだ文章をディープ・ラーニング・アルゴリズムは理解できますご覧いただいているのはスタンフォード大のシステムですが一番上の点が赤色になっていて文が全体としてネガティブな感情を表していることを示していますディープ・ラーニングは今や文章が何について何を言っているのかを人間に近い精度で理解できるようになっているのですディープ・ラーニングは中国語を読むのにも使われ中国語のネイティブ話者並の精度がありますこれを開発したのはスイスのチームですがその中に中国語の分かる人はいなかったそうですディープ・ラーニングはこれに関してネイティブの人間にも劣らない最も優れたシステムなのです

これは私の会社で構築したシステムですべてを組み合わせたものですこれらの画像にはテキストが紐付けされてはおらずユーザーが文をタイプするとリアルタイムで画像を理解し何の画像かを判別して書き込まれた文に近い画像を見つけますだから私の書いた文とこれらの画像を同時に理解しているわけです Googleのサイトで似たものを見たことがあるでしょう何かタイプすると画像が表示されますがそこで実際に行われているのはテキストによるウェブページの検索です画像を理解するというのとはずいぶん違うことですこのようなことができるようになったのはほんのここ数ヶ月のことです

コンピューターには「見る」だけでなく「読む」こともでき「聞く」ことによって理解できることもお見せしましたそうすると「書く」ことだってできると言っても驚かないかもしれませんこれは私が昨日ディープ・ラーニング・アルゴリズムで生成したテキストですこちらはスタンフォード大のアルゴリズムで生成されたテキストですそれぞれの画像を説明する文がディープ・ラーニング・アルゴリズムによって生成されていますアルゴリズムは「ギターを弾いている黒いシャツの男」を前に見たことはありません「男」を見たことはあり「黒い」ものを見たことはあり「ギター」を見たことはありますがこのキャプションは画像に対して新しく独自に作り出されたものです書くことに関してはコンピューターはまだ人間に及びませんが近づいていますテストでは４回に１回はコンピューターの生成した文の方が好ましい ― という結果になっていますこのシステムはできてまだ２週間しかたっていないのでこのまま行くとたぶん来年中にはコンピューターアルゴリズムの成績が人間を上回るのではと思いますだからコンピューターは書くこともできるのです

これらをまとめたら非常に興味深い可能性が開けますたとえば医療ですあるボストンのチームはコンピューターによって医師ががんの診断をする上で役に立つ何十という腫瘍の特徴を発見したと発表しました同様にスタンフォードのグループは組織の拡大画像を見てがん患者の生存率を人間の病理医よりも正確に予想する機械学習システムを開発しましたどちらのケースも予測が人間より正確というだけでなく新たな科学的洞察をもたらしています放射線医学のケースでは人間に理解できる新しい臨床的な指標です病理学のケースでは診断においてがん細胞だけでなくがんの周囲の細胞も重要であることを発見しましたこれは病理医が何十年も教わってきたのとは逆ですどちらのケースでもシステムは医学の専門家と機械学習の専門家の組み合わせによって開発されましたが去年我々はこの面をも乗り越えましたこれは顕微鏡で見た人の組織からがんの領域を識別する例ですこのシステムは人間の病理医と同じかそれ以上の精度でがん領域を識別できますが医療の知識や経験のないチームによってディープ・ラーニングを使って開発されました同様にこれはニューロンの区分けです今ではニューロンを人間と同じ正確さで区分けできますがこのシステムは医学を学んだことのない人々がディープ・ラーニングを使って開発しました

医学を学んだことのない人間が医療の会社を始めるのももはや変なことではないと思え実際に会社を作ることにしましたそうするのは怖くもありましたがデータ分析技術だけでも有益な医療サービスは提供可能であると理論は示しているように見えますありがたいことに大変好意的な反応を受け取っておりメディアばかりでなく医学界の人々も支持してくれています私たちは医療の中間部分を受け持ってそこを可能な限りデータ分析で置き換え医師には彼らが最も適した部分をやってもらうというのが基本方針です例をお見せしたいと思います新しい医療診断テストの生成には現在15分ほどかかりますそれをリアルタイムでご覧に入れますが一部をはしょって３分に縮めてやります医療診断テストを作ってお見せするよりは車の画像を診断するテストをお見せしようと思いますその方が分かりやすいので

150万の車の画像から始めますまず写真を撮った角度によって分類したいと思います画像にラベルはまったく付いておらず一から始めますディープ・ラーニング・アルゴリズムを使って写っている構造領域を自動的に識別することができますこれの良いところは人とコンピューターで協力して作業できるところですご覧のように人が関心のある領域をコンピューターに教えコンピューターがそれに基づいてアルゴリズムを改良しますこのディープ・ラーニング・システムは１万６千次元空間になっていてその空間の中で軸を回転させて新たな構造領域を見つけようとしますそれが成功したら人間が関心のある領域を指摘しますコンピューターがうまく領域を見つけられましたたとえば角度ですこのプロセスを経ることでどのような構造を探しているのか徐々に伝えていきますこれが病気の診断であれば病理医が病的状態にある領域を識別するとか放射線医が問題のある可能性のある小結節を示すといったことを想像できるでしょう時にアルゴリズムには難しいこともあります今の場合コンピューターが混乱して前部と後部がごちゃまぜになっていますそのため少し注意して手で前部を後部から選り分けてやらなければなりませんそうやってこんなグループに関心があるのだとコンピューターに伝えるのです

こうやって続けていき少しはしょりますが機械学習アルゴリズムを改善させるために数百の事例を使って訓練してやります画像の一部が薄れていますがこれはどう理解すれば良いか既に認識されたものですそれから似たイメージという概念を使ってやることでコンピューターが車の前部だけを見つけられるようになりましたそうなったら人間がコンピューターにその点で上手くできていることを教えてやります

もちろんこの期に及んでもある種のグループを分離するのが難しいことがあります今の場合コンピューターにしばらく回転をさせても依然として左側と右側の画像が混在していますコンピューターにもう少しヒントをやり右側と左側を可能な限り分離できる射影をディープ・ラーニング・アルゴリズムを使って見つけられるようにしますそのヒントを与えることで ― 上手くいきました右側と左側を見分ける方法をどうにか見つけられました

基本的な考え方を分かっていただけたと思いますこれは人間がコンピューターに置き換えられるという話ではなく ― 人とコンピューターが協力するということですやろうとしているのはこれまでは５、６人のチームで何年もかかっていたようなことを１人で15分ほどでできるようにするということです

このプロセスには４、５回の反復が必要です 150万の画像を 62%の精度で分類できるようになりましたそうなったら大きなセクションを選んで誤りがないか素早くチェックできます誤りがあった場合はコンピューターに教えてやりますそれぞれのグループについてそういうことを行うことで 150万の画像を 80%の精度で分類できるようになりましたそうしたら正しく分類されなかった少数のケースについてその理由を考えますこのアプローチを 15分やることで 97%の精度で分類できるようになりました

このようなテクニックは世界の重要な問題を解決してくれるでしょう世界的な医師不足です世界経済フォーラムは発展途上国において医師が今の10倍から20倍必要でそれだけの医師を育てるには 300年かかると言っていますディープ・ラーニングを使って医療の効率を上げることで対処するというのはどうでしょう？

このような機会に私はワクワクしていますが同時に懸念していることもあります地図で青になっている国は雇用の80%以上がサービス業のところですサービスとは何か？このようなものですこれらのことはコンピューターができるようになりつつあることでもあります先進国の雇用の80%はコンピューターができるようになったことで成り立っているのですこれは何を意味するのでしょう？「他の仕事で置き換えられるから問題ないよたとえばデータサイエンティストの仕事とか」と思うかもしれませんがこのようなものをデータサイエンティストが構築するのにそう時間はかかりませんたとえば今回取り上げた４つのアルゴリズムは１人の人間によって作られたものですこういうことは以前にも起き新しいものが現れては古い職が新しい職で置き換えられてきたと言うならその新しい職はどのようなものになるのでしょう？とても難しい問題ですなぜなら人間の能力は徐々にしか向上しませんがディープ・ラーニング・システムの能力は指数関数的に向上しているからです私達がいるのは追い抜かれる一歩手前です今は周りを見渡して「コンピューターはまだ馬鹿だ」と思っていても５年もしたらこのグラフの天井を突き破ってしまうでしょう私たちは今この能力について考える必要があるのです

前にも似たことは経験しています産業革命ですエンジンの出現による能力の急激な変化がありましたしかししばらくすると物事はまた落ち着きました社会的な変動はありましたがあらゆる場面でエンジンが使われるようになると状況は安定したのです機械学習の革命は産業革命とは全然違うものになるでしょう機械学習の革命は留まることがないからですより優れたコンピューターが知的活動を受け持ちそれによって知的活動にさらに優れたコンピューターが作れるようになり世界がかつて経験したことのないような変化を起こすことになるでしょう何が起こりうるかについての以前の知見は当てはまらないのです

この影響は既に現れています過去25年で資本生産性は増大しましたが労働生産性は平坦でむしろ少し下がっています

だからこの議論を今始めて欲しいのです私がこの状況を説明してもなかなか真剣に取り合ってもらえません「コンピューターには本当に思考することはできない」「感情がない」「詩を理解しない」「我々は腦の働きを本当に理解してはいない」などなどだったら何でしょう？人間がお金をもらい時間を費やしてやっていたことが機械にも可能になっているんですこの新たな現実を踏まえて社会構造や経済構造をどう調整したら良いか考え始めるべき時ですありがとうございました (拍手)

―　もっと見る　―

―　折りたたむ　―

品詞分類

主語
動詞
助動詞
準動詞
関係詞等

品詞分類表

TED 日本語

TED Talks

関連動画

洋楽おすすめ

RECOMMENDS

洋楽歌詞