{"id":2462,"date":"2014-10-19T06:56:11","date_gmt":"2014-10-18T19:56:11","guid":{"rendered":"http:\/\/blog.panicola.com\/?p=2462"},"modified":"2014-10-19T06:56:11","modified_gmt":"2014-10-18T19:56:11","slug":"data-mining-health-records","status":"publish","type":"post","link":"https:\/\/blog.panicola.com\/?p=2462","title":{"rendered":"data mining health records"},"content":{"rendered":"<p>\u201cThe goal in health care is not to protect privacy, the goal is to save lives. We need to have that as starting point,\u201d said David Castro, director of the Center for Data Innovation. \u201cIs the doctor treating me based on the last couple patients he saw, or is he treating me based on the rigorous analysis of millions of patents and finding the 5,000 that are actually just like me, and treating me in a much more accurate way?\u201d<\/p>\n<p>For data mining to succeed would also require recruiting top data scientists to health care, which isn\u2019t easy given the demand in <a href=\"http:\/\/readwrite.com\/2014\/07\/21\/data-scientist-income-skills-jobs\">the hot field<\/a>.<\/p>\n<p>http:\/\/www.washingtonpost.com\/blogs\/innovations\/wp\/2014\/10\/01\/the-incredible-potential-and-dangers-of-data-mining-health-records\/<\/p>\n<div id=\"article-national_1401984660488_727\" class=\"pb-layout-item pb-feature pb-three pb-f-article-article-topper pb-f-theme-normal\" data-pb-async=\"false\" data-pb-columns=\"3\" data-pb-name=\"article-article-topper\" data-pb-content-uri=\"\/pb\/blogs\/innovations\/wp\/2014\/10\/01\/the-incredible-potential-and-dangers-of-data-mining-health-records\/\" data-pb-content-editable=\"\">\n<div id=\"article-topper\" class=\"article-topper\">\n<h1>The incredible potential and dangers of data mining health records<\/h1>\n<\/div>\n<\/div>\n<div class=\"pb-scatter\">\n<div id=\"main-content\" class=\"pb-layout-item pb-chain pb-two main-content\" data-pb-columns=\"2\">\n<div id=\"article-national_1401984690056_321\" class=\" pb-feature pb-chained pb-f-article-article-body pb-f-theme-normal\" data-pb-async=\"false\" data-pb-columns=\"\" data-pb-name=\"article-article-body\" data-pb-content-uri=\"\/pb\/blogs\/innovations\/wp\/2014\/10\/01\/the-incredible-potential-and-dangers-of-data-mining-health-records\/\" data-pb-content-editable=\"\">\n<div id=\"article-body\" class=\"article-body\">\n<div class=\"pb-sig-line hasnt-headshot has-0-headshots hasnt-bio is-not-column\"><span class=\"pb-byline\">By <a href=\"http:\/\/www.washingtonpost.com\/people\/matt-mcfarland\">Matt McFarland<\/a><\/span> <span class=\"pb-timestamp\">October 1<\/span><img decoding=\"async\" class=\"zoom-in\" src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Copy-of-google-datac2346enter-tech-221363877103_image_982w.jpg&amp;w=1484\" alt=\"\" data-hi-res-src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Copy-of-google-datac2346enter-tech-221363877103_image_982w.jpg&amp;w=1484\" data-low-res-src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Copy-of-google-datac2346enter-tech-221363877103_image_982w.jpg&amp;w=480\" \/><\/div>\n<article>\n<div class=\"inline-content inline-photo inline-photo-normal modal-0 horizontal-photo\"><span class=\"pb-caption\">Using the world\u2019s best computers to analyze our health data could revolutionize how we live. (Connie Zhou\/Google)<\/span><\/div>\n<p>Imagine if your doctor could compare your physical health, diet and lifestyle to a thousand Americans with similar characteristics, and realize that you need treatment to prevent heart failure next month.<\/p>\n<p>What if an analysis of your genome could help a physician give you a customized cancer treatment that saves your life?<\/p>\n<p>Unleashing the modern power of computers, data crunching and artificial intelligence could revolutionize health care, improving and extending lives.<\/p>\n<p>It\u2019s the kind of potential Google chief executive Larry Page hinted at when he <a href=\"http:\/\/www.nytimes.com\/2014\/06\/26\/technology\/personaltech\/a-reach-too-far-by-google.html\">told the New York Times<\/a> earlier this year that \u201cwe\u2019d probably save 100,000 lives next year,\u201d if we data mined health care data.<\/p>\n<p>\u201cImagine you had the ability to search people\u2019s medical records in the U.S.,\u201d Page <a href=\"http:\/\/www.khoslaventures.com\/fireside-chat-with-google-co-founders-larry-page-and-sergey-brin\">said in another interview<\/a> this summer. \u201cI imagine that would save 10,000 lives in the first year.\u201d<\/p>\n<p>Page\u2019s numbers sound impressive, but are speculative and unfounded, according to many in the medical industry.<\/p>\n<p>Interviews with more than a dozen\u00a0health care professionals and data scientists found no evidence backing Page\u2019s specific claims. While they universally agree that data mining \u2014 the examination and analysis of huge batches of information\u00a0\u2013 could invigorate health care, they caution that any sort of accurate estimate would be impossible.<\/p>\n<p>\u201cUsually when I see someone put a number on it and throw around saving lives it usually means one, they aren\u2019t usually a clinician or someone who provides care, or No. 2\u00a0it\u2019s someone who really knows better, but is trying to grab a headline,\u201d said Nicholas Marko, the department head of data science at the Geisinger Medical Center.<\/p>\n<p>A Google spokeswoman declined to offer an explanation of Page\u2019s numbers, or make him available for comment.<\/p>\n<p>In one other instance where Page has used an unsubstantiated health care statistic, he\u00a0<a href=\"http:\/\/business.time.com\/2013\/09\/18\/google-extend-human-life\/\">told Time Magazine <\/a>\u00a0last year that solving cancer would only \u201cadd about three years to people\u2019s average life expectancy.\u201d That\u2019s a figure the American Cancer Society and National Cancer Institute had never heard of before. A Google spokeswoman didn\u2019t have an answer when asked for an explanation.<\/p>\n<p>To a cynic, Page is a shrewd businessman twisting facts to shape the national dialogue so that he can profit from absorbing our health data into\u00a0the Google cloud, where his world-class engineers will find ways to make money off all of that information.<\/p>\n<p>An optimist might remember <a href=\"http:\/\/www.businessinsider.com\/larry-page-the-untold-story-2014-4?op=1\">Page\u2019s assertion<\/a> that Google is a company devoted to solving \u201chuge problems for hundreds of millions of people,\u201d and offer him the benefit of the doubt.<\/p>\n<p>\u201cHealth care has been pretty archaic. We\u2019re pretty behind the curve on things,\u201d said Lorren Pettit, a vice president for the Healthcare Information and Management Systems Society, which aims to improve health care through information technology. \u00a0\u201cWe need the innovation of people from outside health care to come in and take\u00a0a look and challenge this industry, and yes with data mining there\u2019s a great world of possibility.\u201d<\/p>\n<p>Shaking up industries is part of Google\u2019s DNA. Its self-driving car project could in theory eliminate the <a href=\"http:\/\/www.washingtonpost.com\/world\/roads-kill-the-toll-of-traffic-accidents-are-rising-in-poor-countries\/2014\/01\/12\/b1065922-56d2-11e3-8304-caf30787c0a9_story.html\">1.24 million<\/a> fatalities a year on global roads. If Page can soften a country\u2019s fears about sharing our health data \u2014 which ends up saving lives \u2014 does the end justifies his means of fuzzy math?<\/p>\n<p>\u201cThere\u2019s tremendous opportunity if we start taking individualized genomic data and health histories and assuming you can perfectly de-identify it, my gosh, if you can mine that and look for patterns between genomic sequences and types of illnesses and effects of treatment on those illnesses you could potentially do a tremendous amount for society and the health of our individuals,\u201d said Christopher Jaeger, Sutter Health\u2019s chief medical information officer.<\/p>\n<p>The average person might spend a few hours a year with their physician, during which data about their health (blood pressure, alcohol consumption, weight, etc.) is written down. If a patient\u2019s health data was tracked 24-7 \u2014 as devices such as smartwatches are making realistic \u2014 there would be an exponential leap\u00a0in the amount of data about someone\u2019s health. More information \u2014 and the comparison of that information to other patients \u2014 should lead to better treatments.<\/p>\n<p>\u201cIt would be great if when the patient walked in our Bluetooth sensors picked up their phone and it pushed in all their exercise and diet history, and then there were analytics that were performed in real time,\u201d said Thomas Graf, chief medical officer at Geisinger Health System. \u201cWhen the doc walked in the room they can say \u2018Oh, looks like you\u2019re exercising at 80 percent of what we were talking about.\u2019 \u201d<\/p>\n<p>But fear of litigation, privacy concerns, regulations and the challenge of collecting and standardizing data all stand in the way of realizing this health care utopia. Still, there are some early examples that hint at what could be done.<\/p>\n<p>Researchers at the University of Bern in Switzerland have built a computer program to better measure the size of brain tumors.<\/p>\n<p>Traditionally radiologists look at MRI scans and measure in two dimensions the size of a tumor. The computer program \u2014 called BraTumIA \u2014 is capable of a 3D analysis of the tumor\u2019s volume, which better measures whether it\u2019s shrinking or growing. Getting measurements right is crucial as physicians determine the best treatment plan for a patient.<\/p>\n<p>\u201cIf I ask two radiologists to do the same job, you will see differences,\u201d said researcher Mauricio Reyes. \u201cThe computer has the ability to be more consistent and more objective over time. Even if you have an error in the computer this error is consistent over time. What really matters is the trend.\u201d<\/p>\n<p>Here\u2019s how the program works.\u00a0A set\u00a0of annotated brain scans \u2014 in which different parts of a tumor are labeled \u2014 are preloaded into the program. The program uses those as a guide to\u00a0teach itself to identify different parts of future brain scans as a tumor or not.<\/p>\n<div class=\"inline-content inline-photo inline-photo-normal modal-1 horizontal-photo\"><a name=\"a805aa5ca6\"><\/a><img decoding=\"async\" class=\"zoom-in\" src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Un2346236titled.png&amp;w=1484\" alt=\"\" data-hi-res-src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Un2346236titled.png&amp;w=1484\" data-low-res-src=\"http:\/\/www.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/img.washingtonpost.com\/blogs\/innovations\/files\/2014\/09\/Un2346236titled.png&amp;w=480\" \/><br \/>\n<span class=\"pb-caption\">A screenshot of BraTumIA in action. (Mauricio Reyes)<\/span><\/div>\n<p>The end result is being able to run a scan for five minutes on a laptop and having a better understanding of a tumor. If more medical images made their way into databases such as BraTumIA, those services would get even better.<\/p>\n<p>But what if health data we think is anonymous gets identified or hacked? The threat of being sued deters health organizations from sharing data and embracing the full potential of data mining.<\/p>\n<p>For example, MRI exams and CT scans of a patient\u2019s head could\u00a0be used to reconstruct a person\u2019s face. A hacker with access to such a database could use face-detection software to crosscheck the scans with a Web site where users post photos of themselves.<\/p>\n<p>\u201cIf the same person has a Facebook account there\u2019s a good chance that you could identify this person. If I had access to such a database I could give you a list of people in Facebook with names of who has a brain tumor,\u201d cautioned Bjoern Menze, a computer science professor at TU Munchen who researches medical imaging.<\/p>\n<p>\u201cThere will be criminals. There will be people who are bad actors. At some point something is going to get out,\u201d Graf said. \u201cIt\u2019s not an irrational fear. At the same time, people die driving every year and we still choose to drive cars, or most of us do. It\u2019s a risk every person has to decide where they fall on the line.\u201d<\/p>\n<p>Many of those I interviewed anticipated a situation where patients could decide whether to opt into data mining of their health records. A tax benefit might even be given to encourage involvement.<\/p>\n<p>If health records are ever going to be data mined, it\u2019ll happen when consumers are convinced the perks outweigh the costs. The world has already seen dramatic changes to privacy norms as services such as Facebook grow in popularity. It\u2019s incredibly popular Newsfeed \u2014 which funnels the latest information about friends into a feed \u2014 <a href=\"http:\/\/www.washingtonpost.com\/wp-dyn\/content\/article\/2006\/09\/06\/AR2006090601805.html\">was initially met with uproar<\/a> by users concerned about their privacy.<\/p>\n<p>But as users saw the utility of the feed, the tradeoff in privacy became acceptable.<\/p>\n<p>\u201cThe goal in health care is not to protect privacy, the goal is to save lives. We need to have that as starting point,\u201d said David Castro, director of the Center for Data Innovation. \u201cIs the doctor treating me based on the last couple patients he saw, or is he treating me based on the rigorous analysis of millions of patents and finding the 5,000 that are actually just like me, and treating me in a much more accurate way?\u201d<\/p>\n<p>For data mining to succeed would also require recruiting top data scientists to health care, which isn\u2019t easy given the demand in <a href=\"http:\/\/readwrite.com\/2014\/07\/21\/data-scientist-income-skills-jobs\">the hot field<\/a>.<\/p>\n<p>\u201cIt\u2019s hard,\u201d said John Weinstein, chair of bioinformatics and\u00a0computational biology at MD Anderson Cancer Center. \u201cYou really have to battle with Silicon Valley and the Boston academic scene.\u201d<\/p>\n<p>\u201cWhy would someone who is really really good at analyzing data come to work for a health care organization and make X dollars when they could go to Google and make 10X dollars?\u201d Marko added.<\/p>\n<\/article>\n<div class=\"post-body-sig-line\"><a href=\"http:\/\/www.washingtonpost.com\/people\/matt-mcfarland\"><img decoding=\"async\" class=\"post-body-headshot-left\" src=\"http:\/\/img.washingtonpost.com\/wp-apps\/imrs.php?src=http:\/\/wp-eng-static.washingtonpost.com\/author_images\/mcfarlandmh.jpg?ts=1402601608910&amp;h=90&amp;w=90\" alt=\"\" \/><\/a><\/p>\n<div class=\"post-body-bio has-photo\">Matt McFarland is the editor of Innovations. He&#8217;s always looking for the next big thing. You can find him on <a href=\"https:\/\/twitter.com\/mattmcfarland\">Twitter<\/a> and <a href=\"https:\/\/www.facebook.com\/ideasaboutinnovations\">Facebook<\/a>.<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>\u201cThe goal in health care is not to protect privacy, the goal is to save lives. We need to have that as starting point,\u201d said David Castro, director of the Center for Data Innovation. \u201cIs the doctor treating me based on the last couple patients he saw, or is he treating me based on the &hellip; <a href=\"https:\/\/blog.panicola.com\/?p=2462\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">data mining health records<\/span> <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,22,3],"tags":[],"class_list":["post-2462","post","type-post","status-publish","format-standard","hentry","category-data-saving-lives","category-policy","category-rapid-learning-health-systems"],"_links":{"self":[{"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/posts\/2462","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2462"}],"version-history":[{"count":1,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/posts\/2462\/revisions"}],"predecessor-version":[{"id":2463,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=\/wp\/v2\/posts\/2462\/revisions\/2463"}],"wp:attachment":[{"href":"https:\/\/blog.panicola.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2462"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2462"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.panicola.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2462"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}