Ideas worth spreading

Get the perfect ideas,

selected just for you

TED日本語

TED Talks（英語日本語字幕付き動画）

TED日本語 - ニコラス・クリスタキス: いかに社会的ネットワークが流行を予想するか

TED Talks

いかに社会的ネットワークが流行を予想するか

How social networks predict epidemics

ニコラス・クリスタキス

Nicholas Christakis

内容

人々の複雑な社会的ネットワークを可視化したニコラス・クリスタキスとジェームス・フォウラーは、この情報をもとにして生活の改善をする方法の研究に着手しました。ここではクリスタキスが、革新的なアイデアやリスクのある行動、新型インフルエンザのようなウイルス等の流行は、社会的ネットワークを利用することによって従来より早期に感知できるという最新の研究結果を紹介します。

カテゴリ: 科学と技術

タグ　　: TED日本語

外部リンク: TED｜ニコラス・クリスタキス: いかに社会的ネットワークが流行を予想するか YouTube｜Nicholas Christakis: How social networks predict epidemics

字幕

SCRIPT

Script

For the last 10 years, I've been spending my time trying to figure out how and why human beings assemble themselves into social networks. And the kind of social network I'm talking about is not the recent online variety, but rather, the kind of social networks that human beings have been assembling for hundreds of thousands of years, ever since we emerged from the African savannah. So, I form friendships and co-worker and sibling and relative relationships with other people who in turn have similar relationships with other people. And this spreads on out endlessly into a distance. And you get a network that looks like this. Every dot is a person. Every line between them is a relationship between two people -- different kinds of relationships. And you can get this kind of vast fabric of humanity, in which we're all embedded.

And my colleague, James Fowler and I have been studying for quite sometime what are the mathematical, social, biological and psychological rules that govern how these networks are assembled and what are the similar rules that govern how they operate, how they affect our lives. But recently, we've been wondering whether it might be possible to take advantage of this insight, to actually find ways to improve the world, to do something better, to actually fix things, not just understand things. So one of the first things we thought we would tackle would be how we go about predicting epidemics.

And the current state of the art in predicting an epidemic -- if you're the CDC or some other national body -- is to sit in the middle where you are and collect data from physicians and laboratories in the field that report the prevalence or the incidence of certain conditions. So, so and so patients have been diagnosed with something, or other patients have been diagnosed, and all these data are fed into a central repository, with some delay. And if everything goes smoothly,one to two weeks from now you'll know where the epidemic was today. And actually, about a year or so ago, there was this promulgation of the idea of Google Flu Trends, with respect to the flu, where by looking at people's searching behavior today, we could know where the flu -- what the status of the epidemic was today, what's the prevalence of the epidemic today.

But what I'd like to show you today is a means by which we might get not just rapid warning about an epidemic, but also actually early detection of an epidemic. And, in fact, this idea can be used not just to predict epidemics of germs, but also to predict epidemics of all sorts of kinds. For example, anything that spreads by a form of social contagion could be understood in this way, from abstract ideas on the left like patriotism, or altruism, or religion to practices like dieting behavior, or book purchasing, or drinking, or bicycle-helmet [ and ] other safety practices, or products that people might buy, purchases of electronic goods, anything in which there's kind of an interpersonal spread. A kind of a diffusion of innovation could be understood and predicted by the mechanism I'm going to show you now.

So, as all of you probably know, the classic way of thinking about this is the diffusion-of-innovation, or the adoption curve. So here on the Y-axis, we have the percent of the people affected, and on the X-axis, we have time. And at the very beginning, not too many people are affected, and you get this classic sigmoidal, or S-shaped, curve. And the reason for this shape is that at the very beginning, let's say one or two people are infected, or affected by the thing and then they affect, or infect,two people, who in turn affect four,eight,16 and so forth, and you get the epidemic growth phase of the curve. And eventually, you saturate the population. There are fewer and fewer people who are still available that you might infect, and then you get the plateau of the curve, and you get this classic sigmoidal curve. And this holds for germs, ideas, product adoption, behaviors, and the like. But things don't just diffuse in human populations at random. They actually diffuse through networks. Because, as I said, we live our lives in networks, and these networks have a particular kind of a structure.

Now if you look at a network like this -- this is 105 people. And the lines represent -- the dots are the people, and the lines represent friendship relationships. You might see that people occupy different locations within the network. And there are different kinds of relationships between the people. You could have friendship relationships, sibling relationships, spousal relationships, co-worker relationships, neighbor relationships and the like. And different sorts of things spread across different sorts of ties. For instance, sexually transmitted diseases will spread across sexual ties. Or, for instance, people's smoking behavior might be influenced by their friends. Or their altruistic or their charitable giving behavior might be influenced by their coworkers, or by their neighbors. But not all positions in the network are the same.

So if you look at this, you might immediately grasp that different people have different numbers of connections. Some people have one connection, some have two, some have six, some have 10 connections. And this is called the "degree" of a node, or the number of connections that a node has. But in addition, there's something else. So, if you look at nodes A and B, they both have six connections. But if you can see this image [ of the network ] from a bird's eye view, you can appreciate that there's something very different about nodes A and B. So, let me ask you this -- I can cultivate this intuition by asking a question -- who would you rather be if a deadly germ was spreading through the network, A or B? (Audience: B.) Nicholas Christakis: B, it's obvious. B is located on the edge of the network. Now, who would you rather be if a juicy piece of gossip were spreading through the network? A. And you have an immediate appreciation that A is going to be more likely to get the thing that's spreading and to get it sooner by virtue of their structural location within the network. A, in fact, is more central, and this can be formalized mathematically. So, if we want to track something that was spreading through a network, what we ideally would like to do is to set up sensors on the central individuals within the network, including node A, monitor those people that are right there in the middle of the network, and somehow get an early detection of whatever it is that is spreading through the network.

So if you saw them contract a germ or a piece of information, you would know that, soon enough, everybody was about to contract this germ or this piece of information. And this would be much better than monitoring six randomly chosen people, without reference to the structure of the population. And in fact, if you could do that, what you would see is something like this. On the left-hand panel, again, we have the S-shaped curve of adoption. In the dotted red line, we show what the adoption would be in the random people, and in the left-hand line, shifted to the left, we show what the adoption would be in the central individuals within the network. On the Y-axis is the cumulative instances of contagion, and on the X-axis is the time. And on the right-hand side, we show the same data, but here with daily incidence. And what we show here is -- like, here -- very few people are affected, more and more and more and up to here, and here's the peak of the epidemic. But shifted to the left is what's occurring in the central individuals. And this difference in time between the two is the early detection, the early warning we can get, about an impending epidemic in the human population.

The problem, however, is that mapping human social networks is not always possible. It can be expensive, not feasible, unethical, or, frankly, just not possible to do such a thing. So, how can we figure out who the central people are in a network without actually mapping the network? What we came up with was an idea to exploit an old fact, or a known fact, about social networks, which goes like this: Do you know that your friends have more friends than you do? Your friends have more friends than you do, and this is known as the friendship paradox. Imagine a very popular person in the social network -- like a party host who has hundreds of friends -- and a misanthrope who has just one friend, and you pick someone at random from the population; they were much more likely to know the party host. And if they nominate the party host as their friend, that party host has a hundred friends, therefore, has more friends than they do. And this, in essence, is what's known as the friendship paradox. The friends of randomly chosen people have higher degree, and are more central than the random people themselves.

And you can get an intuitive appreciation for this if you imagine just the people at the perimeter of the network. If you pick this person, the only friend they have to nominate is this person, who, by construction, must have at least two and typically more friends. And that happens at every peripheral node. And in fact, it happens throughout the network as you move in, everyone you pick, when they nominate a random -- when a random person nominates a friend of theirs, you move closer to the center of the network. So, we thought we would exploit this idea in order to study whether we could predict phenomena within networks. Because now, with this idea we can take a random sample of people, have them nominate their friends, those friends would be more central, and we could do this without having to map the network.

And we tested this idea with an outbreak of H1N1 flu at Harvard College in the fall and winter of 2009, just a few months ago. We took 1,300 randomly selected undergraduates, we had them nominate their friends, and we followed both the random students and their friends daily in time to see whether or not they had the flu epidemic. And we did this passively by looking at whether or not they'd gone to university health services. And also, we had them [ actively ] email us a couple of times a week. Exactly what we predicted happened. So the random group is in the red line. The epidemic in the friends group has shifted to the left, over here. And the difference in the two is 16 days. By monitoring the friends group, we could get 16 days advance warning of an impending epidemic in this human population.

Now, in addition to that, if you were an analyst who was trying to study an epidemic or to predict the adoption of a product, for example, what you could do is you could pick a random sample of the population, also have them nominate their friends and follow the friends and follow both the randoms and the friends. Among the friends, the first evidence you saw of a blip above zero in adoption of the innovation, for example, would be evidence of an impending epidemic. Or you could see the first time the two curves diverged, as shown on the left. When did the randoms -- when did the friends take off and leave the randoms, and [ when did ] their curve start shifting? And that, as indicated by the white line, occurred 46 days before the peak of the epidemic. So this would be a technique whereby we could get more than a month-and-a-half warning about a flu epidemic in a particular population.

I should say that how far advanced a notice one might get about something depends on a host of factors. It could depend on the nature of the pathogen -- different pathogens, using this technique, you'd get different warning -- or other phenomena that are spreading, or frankly, on the structure of the human network. Now in our case, although it wasn't necessary, we could also actually map the network of the students.

So, this is a map of 714 students and their friendship ties. And in a minute now, I'm going to put this map into motion. We're going to take daily cuts through the network for 120 days. The red dots are going to be cases of the flu, and the yellow dots are going to be friends of the people with the flu. And the size of the dots is going to be proportional to how many of their friends have the flu. So bigger dots mean more of your friends have the flu. And if you look at this image -- here we are now in September the 13th -- you're going to see a few cases light up. You're going to see kind of blooming of the flu in the middle. Here we are on October the 19th. The slope of the epidemic curve is approaching now, in November. Bang, bang, bang, bang, bang -- you're going to see lots of blooming in the middle, and then you're going to see a sort of leveling off, fewer and fewer cases towards the end of December. And this type of a visualization can show that epidemics like this take root and affect central individuals first, before they affect others.

Now, as I've been suggesting, this method is not restricted to germs, but actually to anything that spreads in populations. Information spreads in populations, norms can spread in populations, behaviors can spread in populations. And by behaviors, I can mean things like criminal behavior, or voting behavior, or health care behavior, like smoking, or vaccination, or product adoption, or other kinds of behaviors that relate to interpersonal influence. If I'm likely to do something that affects others around me, this technique can get early warning or early detection about the adoption within the population. The key thing is that for it to work, there has to be interpersonal influence. It can not be because of some broadcast mechanism affecting everyone uniformly.

Now the same insights can also be exploited -- with respect to networks -- can also be exploited in other ways, for example, in the use of targeting specific people for interventions. So, for example, most of you are probably familiar with the notion of herd immunity. So, if we have a population of a thousand people, and we want to make the population immune to a pathogen, we don't have to immunize every single person. If we immunize 960 of them, it's as if we had immunized a hundred [ percent ] of them. Because even if one or two of the non-immune people gets infected, there's no one for them to infect. They are surrounded by immunized people. So 96 percent is as good as 100 percent. Well, some other scientists have estimated what would happen if you took a 30 percent random sample of these 1000 people,300 people and immunized them. Would you get any population-level immunity? And the answer is no. But if you took this 30 percent, these 300 people and had them nominate their friends and took the same number of vaccine doses and vaccinated the friends of the 300 -- the 300 friends -- you can get the same level of herd immunity as if you had vaccinated 96 percent of the population at a much greater efficiency, with a strict budget constraint.

And similar ideas can be used, for instance, to target distribution of things like bed nets in the developing world. If we could understand the structure of networks in villages, we could target to whom to give the interventions to foster these kinds of spreads. Or, frankly, for advertising with all kinds of products. If we could understand how to target, it could affect the efficiency of what we're trying to achieve. And in fact, we can use data from all kinds of sources nowadays [ to do this ] .

This is a map of eight million phone users in a European country. Every dot is a person, and every line represents a volume of calls between the people. And we can use such data, that's being passively obtained, to map these whole countries and understand who is located where within the network. Without actually having to query them at all, we can get this kind of a structural insight. And other sources of information, as you're no doubt aware are available about such features, from email interactions, online interactions, online social networks and so forth. And in fact, we are in the era of what I would call "massive-passive" data collection efforts. They're all kinds of ways we can use massively collected data to create sensor networks to follow the population, understand what's happening in the population, and intervene in the population for the better. Because these new technologies tell us not just who is talking to whom, but where everyone is, and what they're thinking based on what they're uploading on the Internet, and what they're buying based on their purchases. And all this administrative data can be pulled together and processed to understand human behavior in a way we never could before.

So, for example, we could use truckers' purchases of fuel. So the truckers are just going about their business, and they're buying fuel. And we see a blip up in the truckers' purchases of fuel, and we know that a recession is about to end. Or we can monitor the velocity with which people are moving with their phones on a highway, and the phone company can see, as the velocity is slowing down, that there's a traffic jam. And they can feed that information back to their subscribers, but only to their subscribers on the same highway located behind the traffic jam! Or we can monitor doctors prescribing behaviors, passively, and see how the diffusion of innovation with pharmaceuticals occurs within [ networks of ] doctors. Or again, we can monitor purchasing behavior in people and watch how these types of phenomena can diffuse within human populations.

And there are three ways, I think, that these massive-passive data can be used. One is fully passive, like I just described -- as in, for instance, the trucker example, where we don't actually intervene in the population in any way. One is quasi-active, like the flu example I gave, where we get some people to nominate their friends and then passively monitor their friends -- do they have the flu, or not? -- and then get warning. Or another example would be, if you're a phone company, you figure out who's central in the network and you ask those people, "Look, will you just text us your fever every day? Just text us your temperature." And collect vast amounts of information about people's temperature, but from centrally located individuals. And be able, on a large scale, to monitor an impending epidemic with very minimal input from people. Or, finally, it can be more fully active -- as I know subsequent speakers will also talk about today -- where people might globally participate in wikis, or photographing, or monitoring elections, and upload information in a way that allows us to pool information in order to understand social processes and social phenomena.

In fact, the availability of these data, I think, heralds a kind of new era of what I and others would like to call "computational social science." It's sort of like when Galileo invented -- or, didn't invent -- came to use a telescope and could see the heavens in a new way, or Leeuwenhoek became aware of the microscope -- or actually invented -- and could see biology in a new way. But now we have access to these kinds of data that allow us to understand social processes and social phenomena in an entirely new way that was never before possible. And with this science, we can understand how exactly the whole comes to be greater than the sum of its parts. And actually, we can use these insights to improve society and improve human well-being.

Thank you.

私はこの10年間人はどのようにそしてなぜ社会的ネットワークを形成するのか解明しようと努力してきましたここで言う社会的ネットワークとは最近のインターネット上のものでなくどちらかというとアフリカのサバンナに出現して以来何十万年もの間人類が築いてきた社会的つながりですつまり私が友人関係や同僚関係そして兄弟関係や親類関係を持ちその人達が似た関係を他の人達と持ちこれが果てしなくずっと広がっていってこのようなネットワークができますそれぞれの点は人で間の線は二人が関係していることを表しますいろいろな人間関係ですこのような広大な人間社会の構造ができ私達は皆その一部となっています

私は同僚のジェームスフォウラーとかなり以前からどのような数学的社会的　生物学的そして心理学的な法則がこれらのネットワークの構築を左右するのかまたどんな法則がどうネットワークを動かし人々の生活に影響するのかについて研究してきましたそして最近は解明するだけでなくその洞察を利用して実際に世の中を改善する方法を見つけもっと役立つことをして何かを解決したりできないかと考えていますそこでまず取り組もうと思ったのが疫病の流行を予想することでした

疫病対策センターやその他の国家機関での感染症流行の予測技術の現状は現場の医師や研究所が報告する特定の疾患の有病率や発生率のデータを機関の拠点から収集するというものです患者の誰々さんが何かの病気だと診断された他にも発症した患者がいたこうしたデータが情報センターにいくらか遅れて入るわけです滞りなくすべて進めば今日どこで疫病が流行っていたか 1～2週間後に分かるのです実のところ 1年ほど前に「インフルトレンド」というグーグルのツールが広まりました人々の現在の検索パターンを見てインフルエンザの発生地域現在の流行状況や有病率が把握できるのです

でも今日皆さんにお見せしたいのは伝染病の発生を迅速に警告するだけでなく実際にその流行を早期に感知できるかもしれないひとつの方法です事実このアイデアは細菌による感染症を予測するだけでなく様々なタイプの流行の予想に応用できます例えば社会的感染という形で広まるものはすべてこうして理解できます図の左に示した愛国心や利他主義や宗教のような抽象的な概念から食生活や書籍購入そして飲酒などの習慣自転車ヘルメット着用などの安全習慣や売れる商品電子機器の購入などまで人を通して広がるものすべてです新しいアイデアの普及なども今からご覧いただく方法によって理解し予測することが可能です

おそらく皆さんご存知だと思いますが普及を表すには従来イノベーション普及率という採用曲線を使用します Y軸は何％の人が影響されているかそしてX軸は時間を表します最初の時点ではあまり多くの人が影響されておらず典型的なS字型カーブのグラフになりますなぜこのような形になるのかと言うと一番初めに1人か2人が影響または感染されているとするとその2人が次の2人を感染させ次に感染されるのは4人そして8人 16人と増え流行の増殖期のカーブを形成するからです最終的には人口のほとんどが感染されまだ感染されていない人がどんどん少なくなりカーブは頭打ちとなりますそして典型的なS字型カーブとなるのですこれは病原菌やアイデア製品普及や習慣のようなものでも同じですでも物事は人々の間でランダムに普及しません普及はネットワークを通して行なわれます私達は皆ネットワークの中で生きているからですそしてこれらのネットワークには特定の構造があります

こちらのネットワークを見てください 105人います点は人を表し線は友人関係を表します人によってネットワーク内の位置が違うことが分かると思いますまた人間関係も多様です友人関係兄弟関係夫婦関係同僚関係隣人関係などいろいろありますそして関係によって違うものが広がります例えば性感染症は性的つながりをもって広がります喫煙習慣は友人関係に影響されるかもしれません利他的または慈善的行為だと同僚に感化されてかもしれませんし隣人の影響かもしれませんでもネットワーク内の位置のすべてが平等というわけではありません

これを見てもらえばすぐ分かりますがつながりの数は人によって違います 1つの人もいれば2つの人もいて 6つの人もいれば10個の人もいますこれはノードの度数とも言われ節点の持つつながりの数ですしかしそれだけではありません節点AとBを見てもらうと両者とも6つのつながりを持っていますでもこの図を全体的に見ると節点AとBには大きな違いがあると気づくと思いますこう考えたら分かりやすいと思いますもし致死的な病原菌がネットワーク内で広まっていたらAとBのどちらになりたいですか？（聴衆：B）クリスタキス：もちろんBですね Bはネットワークの端に位置していますでは気になる噂話がネットワーク内で流れていたらどちらになりたいですか？ Aですね　一見して Aの方がいち早く広まる噂を耳にする可能性が高いと分かりますこれはネットワーク構造上の位置のおかげです実際にAは中心寄りに位置しておりこれは数式で表すことができますですからネットワークを通じて広がっている何かを追跡したい場合節点Aも含んだネットワークの中心部の人々にセンサーをつけその人々を観察することによってネットワークを介して広がっている何かを早期発見するのが理想です

この人々が病気に感染したり情報を得たら近いうちに全員にこの病原菌または情報が伝わるだろうと分かるのですこの方法は集団の構造を踏まえずにランダムに選出した 6人を観察するよりずっと効果的です実際中心部の人々を観察できればこのような結果が見られる筈です左の図には前に見たS字型の採用曲線があります赤の点線はランダムに選出された人々の間での普及です左側の左にずれている線はネットワーク中心部の人々の間での普及を表します Y軸は感染者の累積人数です X軸は時間です右にあるのは同じデータですが 1日ごとの発症件数ですここにご覧いただけるのはたった数人の感染者からどんどん増えてここで流行のピークとなることです左にずれたグラフが中心部の人々の状態ですそしてこの2つの間の時間差が兆しとなりこの人々の間で流行が起こる早期警告となるのです

しかし問題は社会的ネットワークを図にするのがいつも可能なわけでないことですコストが高すぎたり実施が難しかったり倫理的でなかったりただ単にそんなことは不可能な場合もありますでは実際にネットワークを図にしないでどのように中心にいるのは誰かを調べることができるのでしょうか？我々が思いついたのは社会的ネットワークについて前から知られている現象を利用することでしたこのような現象ですあなたの友人にはあなたよりたくさん友人がいると知っていましたか？あなたの友人にはあなたより友人がいるのです「友人関係のパラドックス」と言われています社会的ネットワークの中でとても人気があり友人が多いパーティのホストと友人は1人だけの人間嫌いがいるとしますここからランダムに選ばれた人はパーティのホストを知っている確率の方が高いのです彼らがパーティのホストを友人として挙げたらパーティのホストには大勢の友人がいるので彼らよりも友人が多いということになります基本的にこれが「友人関係のパラドックス」というものですランダムに選ばれた人達よりその友人達の方がより多くのつながりを持ち中心寄りの位置にいるのです

ネットワークの端の方にいる人々に注目するとこのことが自然に理解できると思いますこの人を見ると友人として挙げられるのはこの人しかいませんそしてこの人にはネットワークの構造上最低2人通常はそれ以上の友人がいることになります端の節点のどれをとってもこの現象は見られ実際ネットワークの中心に向かって全体的に見られます誰を選出してもですランダムに選出された人が友人を挙げるとネットワークの中心に近づくわけですそこで我々はこのアイデアを利用してネットワーク内の現象を予測できるか研究しようと考えましたこのアイデアをもとにすればネットワークの図がなくても集団からランダムに誰かを選び友人を挙げてもらって中央寄りの人の選出ができるからです

我々はハーバード大学での新型インフルエンザの発生でこれを検証しましたつい2～3ヶ月前の2009年秋から冬でしたランダムに選出した学部生1300人に友人を挙げてもらいそのランダムの学生と友人の両方を毎日追跡調査して流行のインフルエンザへの感染を調べました大学内診療所の利用監視と週に数回のメール報告での調査ですすると我々が予想した通りのことが起こりました赤い線がランダムのグループです友人グループの中での流行は左のこちらへ寄っています 2つのグループの違いは16日です友人グループを追跡することによってこの集団における感染流行を 16日前に警告できるわけです

またそれだけでなくアナリストが流行の研究や新製品の普及の予測をしようするときに集団からランダムに選んだサンプルとさらに挙げてもらった友人の両方のグループを追跡することができます例えばその友人グループでイノベーション普及に急上昇があれば流行の兆しとなりますまた左にあるように2つの線が分岐し始めるのもサインです友人グループの線が急上昇しランダムサンプルのグループに差をつけて開き始めたのはどの時点か？それはこの白い線が示す時点で流行のピークの 46日前でしたつまりこの方法を使えば一定の集団の中で起こるインフルエンザの流行を1ヵ月半以上前に察知できるのです

どのくらい前の時点でそのような兆しが見られるかは様々な要素により異なると思います病原体の特性によることもあり得ますこの方法で違う種類の病原体を見た場合異なる兆候が出ると思います他の広がっている現象でもそうです人のネットワークの構造が違うからということもあります我々の実例では必要ではなかったのですが実際に学生のネットワークを図にすることが出来ました

これが714人の学生と彼らの友人のつながりを示した図ですこれからこの図の移り変わりを見せますネットワークの日々の変化を 120日分見てみましょう赤い点がインフルエンザの感染を示しますそして黄色い点がインフルエンザ感染者の友人です点の大きさはインフルエンザに感染している友人の数に応じて大きくなりますつまり大きい点はインフルエンザに感染した友人が多い人ですこの図を見てください　9月13日の状態ですいくつか色のついた点がみられますインフルエンザが中心でポツポツ見られます今10月19日の状態です 11月になると流行のカーブが立ち上がりパッパッと中心部で次々に感染が広がりますそしてだんだん頭打ちになっていきます 12月末に近づくにつれて感染がどんどん少なくなりますこのような可視化によってこういった流行はまず中央部の人間から感染して他の人々に感染することが明らかになります

それで今まで申し上げてきたようにこの方法は細菌だけでなく人々の間で伝染するもの何にでも使えます情報は人々を通じて広がります常識も人から人へと広がります言動も人々の間で広がります言動というのは犯罪行為や選挙投票もあれば健康管理行為で喫煙や予防接種のようなこともあり製品普及やその他の行動で人間同士が影響し合うものもあります言動によって回りの人間が影響される傾向があったらこの方法によりその集団における流行の発生や兆しを早期に知り得ることができるわけですこの方法が成り立つポイントは人間同士の影響があることです一斉に実施され全員が同じように影響されるような仕組みでは駄目です

さてこの同じ洞察を違うやり方でネットワークに関連するものに対して活用することもできます介入目的のために特定の人々を対象として選ぶのに利用するのが一例です例えば皆さん集団免疫についてはたぶん知っていると思いますが 1000人のグループがいたとしてこのグループをある病原体から守りたい場合全員に予防接種する必要はありませんこのうち960人に免疫ができれば 1000人に予防接種したのと同じになりますたとえ1人か2人の免疫のない人が感染してもその人達が病気をうつす相手がいないからです免疫のある人ばかりに囲まれているわけですこのように96%は100%と同じくらい効果的です 1000人の中から30%をランダムで選出し予防接種をしたらどうなるか計算した科学者達がいましたが集団レベルでの免疫が得られるかと言うと得られませんでもこの同じ30%の300人に友人を挙げてもらって同じ数の予防接種を 300人が挙げた友人達300人に実施すると集団免疫と同等の免疫ができます集団の96%に予防接種したのと同じ効果を厳しい予算でもずっと効率よく得られるのです

似たようなアイデアを使って発展途上国で蚊帳などを配布する際に対象者を限定することもできます村のネットワークの構造が分かっていれば蚊帳などの普及を促進する中心部の人々をターゲットにして介入援助できますまた率直に言ってこれはどんな商品の宣伝にも使えます対象者の選定の仕方が分かれば目的を達成する効率を上げることができます事実現在ありとあらゆるところで集められているデータを利用できます

こちらはヨーロッパにおける 800万人の電話利用者の関係図ですそれぞれの点は人を表し線はその人達の間の電話回数を表します私達はこのような自動的に集められたデータによってこれらの国の全体像を見たりネットワークのどこに誰がいるか理解できます特別なデータ処理などしなくてもこのような構造の洞察を得ることができるのですお気づきと思いますがこのようなデータは他の情報源からも手にすることができますメールやインターネット上のやりとりソーシャルネットワークなどです実際今の時代は大量のデータが自動的に蓄積されています大量に収集されたデータの使い道は幾通りもあります集団を追跡するためのセンサーとなる中心部の人々を特定したりその集団の中で何が起こっているか理解したり改善の為に介入したりできます最近の技術では誰と誰がしゃべっているかだけでなく人々がどこにいるのかも分かるからですアップロードされるものから人々が考えていることが分かり購入記録から商品の売れ筋も分かりますこれらすべての管理データを合わせて処理すれば人々の行動を以前はできなかった方法で理解できます

トラックの運転手による燃料購入を例にします運転手達は普段通りに仕事をして燃料を購入します私たちは燃料の購入量が急上昇するのを見て経済低迷期の終わりが近いと分かりますまたは人々が高速道路を移動している速度を携帯電話で計測することもできます電話会社は速度が落ちるのを見て渋滞を感知できます更にその情報を携帯電話ユーザーに提供できるわけですそれも同じ高速道路上でその渋滞の後続のユーザーに限定できます医師の薬品処方状況を観察することもできます新規の医薬品がどのように医師の間で普及するのか理解することができます人々の商品購入状況の観察をしてこのようなタイプの現象がどうやって人々の間で普及するのか確認することができます

自動蓄積された大量データの利用法は 3つあると思います 1つ目は完全に受身的な先ほど説明したようなものですトラックの運転手の例のような実際には集団に一切介入しないしないものですそして半能動的な例に挙げたインフルエンザのような人々に友人を挙げてもらい彼らがインフルエンザに感染しないか観察して警告を受けるものもあります別の例として電話会社がネットワークの中心に位置する人を調べて「毎日熱を測って携帯メールで送ってもらえますか？」「体温だけでかまいません」と頼み大量の体温データを中心部の人々に限定して収集することも考えられますこうして人々の最低限の情報提供だけで伝染病の流行の兆しを広範囲に監視できるのですまたはもっと積極的なアプローチもできますこのあとの講演者も話しますが人々が世界中からウィキに参加したり写真や選挙の追跡をしたりして情報をアップロードしたものを社会的プロセスや現象を理解するために収集することもできます

事実これらのデータが入手できるのは専門家が言うところの「計算社会科学」のような一種の新たな時代の到来を告げていますこれはガリレオが望遠鏡を使ってそれまでにないやり方で天空の観察ができたことやレーウェンフクが顕微鏡を発明し生物学に新たな見解をもたらしたことに似ています今度は大量データが入手できるようになり社会的プロセスや現象を以前にはなかったやり方で理解することができるようになったわけですそしてこの科学により私達は社会全体が具体的にどうやってただ一人ひとりを足しただけよりも偉大となるのか理解することができるのですそして実際にこれらの洞察を利用して社会および人々の生活を改善できるのです

ありがとうございました

―　もっと見る　―

―　折りたたむ　―

品詞分類

主語
動詞
助動詞
準動詞
関係詞等

品詞分類表

TED 日本語

TED Talks

関連動画

洋楽おすすめ

RECOMMENDS

洋楽歌詞