• Home
  • About
  • Blog
  • News
  • Events
  • Media
  • Video
  • Glossary
  • Contact
  • Download
  • RSS

MediaCloud — A new tool to analyze global media coverage

March 16th, 2009  |  by jz  |  Published in Future of the Internet

The Berkman Center has just launched a very cool new project, MediaCloud, which you can see over at mediacloud.org.  They’re gathering stories from thousands of newspapers, blogs, and other news sources around the web, and then extracting piles of data from the stories—source, topic, entities mentioned, and so on.  Their idea is to figure out how to analyze all that data to answer some longstanding, hard-to-answer questions about media overage.  A few sample questions that Ethan Zuckerman suggests:  What are the biggest differences between citizen media and mainstream media?  Can we track the path of a news story that starts out poorly covered, but eventually explodes, and figure out what caused the shift?  Is the blogosphere really an echo chamber? 

Right now they’ve got a few neat tools hacked together to answer these questions.  You can look at the top ten most covered topics in the news sources of your choice.  You can map a source’s geographical coverage, so you could figure out which source to read if you’re especially interested in, say, Zimbabwe.  You can also look at how terms show up together—for instance, to borrow another example from Ethan, you might also want to know what other terms show up in those Zimbabwe stories:

“[T]he BBC most closely associates Zimbabwe with cholera, followed by the United Kingdom, United Nations, United States. Over here, Fox News—Robert Mugabe as the first thing. So, which is an interesting example of sort of playing the man rather than playing the story. Daily Kos, this is sort of interesting: United States, Afghanistan, Iraq, Washington, China, Pakistan. My guess is that Daily Kos almost has no Zimbabwe-dedicated reporting. It’s sort of general commentary on Obama’s foreign policy.”

These tools are a lot of fun to play with, but the long-term goal is to distribute the work of building tools.  The Berkman Center can provide the data, up to 15,000 sources or so, but its researchers can’t think of or implement all the potential creative ways of analyzing it.  So they’re releasing all the data they’ve collected, and they want people to independently take up that data and do something unexpected or interesting with it.  It’s not hard to think of several functions that it would be useful for someone to build (ability to search by topic rather than by source and language translation immediately jump to mind), and the commenters on the site are busily suggesting more.  You can head on over and check it out, suggest some functions you’d like to see, or start building tools yourself.

Comments are closed.

Blog

  • Dropbox Ran Afoul of Apple’s App Store Review Guidelines: So What?
  • Last week, a number of developers reported that Apple was rejecting iOS applications that used Dropbox, a popular cloud file storage and backup system. An initial thread on the Dropbox developers’ forum has led to a outpouring of tech news full of hyperbolic claims. However, none of this reporting has covered the real problem – Apple is now more concerned about protecting its business model than serving its users or its developers.  Read more »

  • Help pioneer Casebook: The Next Generation
  • We at the H2O project are seeking a full-time Project Manager. H2O is an online platform for textbook development and distribution, currently in a pilot stage. H2O is based on the open source model – instead of locking down materials in formalized textbooks, we believe that course books can be free (as in free speech) for everyone to access and, equally important, build upon.

    Using H2O, professors can freely pull together materials for a course by selecting cases, editing those cases to the sections that are most relevant, and grouping them into readings. Once the materials are assembled, they can be copied in part or in whole by other interested faculty and then edited further.  H2O has been successfully piloted in JZ’s 1L Torts class, and will be rolling out further over the coming year.

    H2O’s project manager will play a leading role in shepherding H2O into its next phase, which will focus on developing new materials and incorporating additional features, in order to expand the platform beyond its law school roots.

    H2O is a  joint project of the Berkman Center for Internet & Society and the Harvard Law School library.  The Project Manager will be housed at the HLS Library and work in close collaboration with lead members of the Library Innovation Lab team; he/she will also work closely with the Berkman Center and current H2O teams. More info and job posting here.

  • Meme patrol: “When something online is free, you’re not the customer, you’re the product.”
  • I participated in the Berkman Center’s fascinating HyperPublic symposium in the summer of 2011.  When moderating a panel I invoked the aphorism that “When something online is free, you’re not the customer, you’re the product.”  It’s a way of encapsulating the idea that online free services usually make money by extracting lots of data from users — and then selling that data, or using it for targeted availability of those users for advertising, to advertisers.  In that sense, the advertisers are the clients, and the users enjoying free content are what’s being sold.  (Of course, sometimes that happens even when the user pays.)

    I didn’t coin the phrase, and since it was featured (and attributed to me!) in wordsmith.org’s wildly popular “word a day” as a thought for the day accompanying the word “enceinte” — I sought to nail down its provenance.

    The first use of the quote that we can find is as a comment within the famed MetaFilter community  in August 2010. The user’s name is blue_beetle, who might be someone named Andrew Lewis.  It’s entirely possible I saw it there, as MeFi is one of my five favorite sites on the Web.

    Similar sentiments (whether drawn from that source or independently invented) have been expressed by Bruce Schneier in October 2010 and by Douglas Rushkoff in September ’11.

    The phrase “you’re the product” also apparently appeared in a 1986 speech by President Reagan about the drug war.

    Just say know.

    –KA and JZ

  • OS X Mountain Lion and Gatekeeper
  • This week, Apple announced that it was moving to a new, faster OS X operating system development cycle, starting with the release of Mountain Lion next summer.  It previewed a number of features for the OS, and released some parts in beta.

    Mountain Lion is slated to include a feature called Gatekeeper as part of the security and privacy settings. Gatekeeper allows administrators (those with full privileges on a Mac) to limit the applications that can run on the Mac.  They can choose among allowing apps downloaded from the Mac App Store only, or apps from outside the Store so long as they are digitally signed to Apple’s satisfaction by their developers, or apps from anywhere.  (The latter has been the way both Mac and Windows PCs have worked, for better or worse, since the introduction of the Apple II in 1977.) Read more »

  • GPS-based Insurance Rates: The Devil is in the (Data) Details
  • A British insurance company called Motaquote has teamed up with TomTom, the GPS manufacturer to offer insurance prices based on data gathered by GPS. Fair Pay Insurance, Motaquote’s new program, is an opt-in insurance pricing scheme where drivers will get a free GPS unit in return for potentially lower (but possibly higher) premiums. The GPS unit will provide all the traditional navigational services as well as warn drivers when they corner too sharply or brake too hard. Read more »

About Jonathan Zittrain

jonathan zittrain

Jonathan Zittrain is Professor of Law at Harvard Law School and co-founder of the Berkman Center for Internet and Society at Harvard Law School

RSS Tweets from Z

  • An error has occurred; the feed is probably down. Try again later.

Blog Archives



Creative Commons BY-NC-SA Jonathan Zittrain unless otherwise noted.
Powered by WordPress using Gridline Lite.