• Home
  • About
  • Blog
  • News
  • Events
  • Media
  • Video
  • Glossary
  • Contact
  • Download
  • RSS

Google liberates data

October 11th, 2009  |  by elisabeth  |  Published in cloud, Future of the Internet  |  4 Comments

Professor Zittrain has spent time on this blog and elsewhere discussing the future of cloud computing. One of his frequent suggestions is that it should be easier to move data within the cloud, so we don’t all get locked into a certain photo storage system, or spreadsheet provider, or what have you. It seems that Google agrees. An Google engineering team called the Data Liberation Front—complete with revolutionary logo—recently revealed that it’s been working since 2007 to make sure people can move their data out of Google products. Blogger and Gmail are done; Google sites and docs are coming next.

As the team explains:

[W]e always encourage people to ask these three questions before starting to use a product that will store their data:

1. Can I get my data out at all?
2. How much is it going to cost to get my data out?
3. How much of my time is it going to take to get my data out?

The ideal answers to these questions are:

1. Yes.
2. Nothing more than I’m already paying.
3. As little as possible.

The Data Liberation Front explains the motivation for doing this in the usual Google manner: Don’t be evil. Don’t trap consumers into products they don’t want to be using.

This is a great step, and a real benefit to the users who want to have choices. Kudos to Google.

One thing to pay attention to, though: the DLF has made it possible to liberate ad-campaign data on AdWords by exporting it in a CSV file. Ben Edelman, a Harvard Business School professor (and sometime Zittrain co-author) who’s been studying online advertising, is skeptical of how well the “liberation” product works for AdWords. He thinks that CSV exports are clumsy, time-consuming, and error prone, whereas an API-based export could be powerful tool for advertisers. API-based export is technically possible, but apparently prohibited by Google’s terms and conditions. Without a good way of syncing data across platforms, he argues, advertisers tend to stick solely with Google. In short, he says, “I credit Google’s efforts to facilitate data portability in its ancillary businesses, like document sharing and image hosting. But when it comes to the one business where Google makes billions of dollars—and where Google has 70%+ market share—Google’s actions reveal the company’s willingness to put its own bottom line before advertisers’ interests and, for that matter, fair competition.” Google argues in reply that CSV export is perfectly workable, and that, in fact, many advertisers do use the AdWords editor to run campaigns for multiple platforms.

The disagreement points out that data portability isn’t an on/off issue—it’s a spectrum, and it bears watching how Google and its competitors fall out along that spectrum.

—By Elisabeth Oppenheimer

Responses

Feed
  1. Saqib Ali says:

    October 11th, 2009 at 7:58 pm (#)

    Vendor lock-in is an issue with any data storage system – in the cloud or hosted in-house. We need to look into and investigate the tools that the vendor provides to extract the data out of the system.

    From what I seen (and experimented with), Google provides excellent set of APIs to access the data stored in Google’s Cloud. And Google is always working on to improve the APIs. Google usually first adds functions to the API, and then introduces them in the UI. Compare this to other software vendors, who usually introduce the new functions in the UI and then at a later time provide API access to those functions – if it all.

    I currently use both Google Docs and Windows Live Workspace to store my personal / school related stuff. I use both of these because they both have their benefits. Windows Live Workspace provides complete integration with Office 2007, whereas Google Docs provide editing capabilities in a Web browser. Recently I have been thinking of writing an application that will synchronize the content of both of these repositories. Google provides APIs that make this task easy from Google’s side, but there are no Windows Live Workspace APIs, so I have to devise a workaround to get documents into the Windows Live Workspace.

    “With problems that we are not aware of yet, the ability to put right – not the sheer good luck of avoiding indefinitely – is our only hope, not just of solving problems, but of making progress. ” – Physicist David Deutsch

  2. Andrew Martin says:

    October 12th, 2009 at 6:50 am (#)

    There’s certainly a need for a voluntary code of practice, and perhaps regulation:

    Picture a future scenario where a significant cloud data provider gets into financial difficulty. As rumours about its viability spread, everyone tries to pull off copies of their own data. This overloads the servers, compounding the fears. The company goes bust, and the liquidators do their best to return data to its rightful owners, but they may not have the resources to do so.

    It’s not unlike a run on the bank – and for those with data significantly ‘invested’ in the provider, approaching the same level of seriousness.

  3. Nick @ Brick Marketing says:

    October 13th, 2009 at 8:43 am (#)

    I think this is a battle that will be difficult to fight in the long run. As soon as you start using some of these products they gain control of what ever you put onto those networks. Facebook has rights to the photos in Facebook. It is the price you pay to use their services. Nobody is forcing anyone to use a Google blog or email.

  4. Kindermode says:

    October 15th, 2009 at 1:30 pm (#)

    I think that it`s dangerous to use this tools and putting own data to networks, everyone can use….

Blog

  • Dropbox Ran Afoul of Apple’s App Store Review Guidelines: So What?
  • Last week, a number of developers reported that Apple was rejecting iOS applications that used Dropbox, a popular cloud file storage and backup system. An initial thread on the Dropbox developers’ forum has led to a outpouring of tech news full of hyperbolic claims. However, none of this reporting has covered the real problem – Apple is now more concerned about protecting its business model than serving its users or its developers.  Read more »

  • Help pioneer Casebook: The Next Generation
  • We at the H2O project are seeking a full-time Project Manager. H2O is an online platform for textbook development and distribution, currently in a pilot stage. H2O is based on the open source model – instead of locking down materials in formalized textbooks, we believe that course books can be free (as in free speech) for everyone to access and, equally important, build upon.

    Using H2O, professors can freely pull together materials for a course by selecting cases, editing those cases to the sections that are most relevant, and grouping them into readings. Once the materials are assembled, they can be copied in part or in whole by other interested faculty and then edited further.  H2O has been successfully piloted in JZ’s 1L Torts class, and will be rolling out further over the coming year.

    H2O’s project manager will play a leading role in shepherding H2O into its next phase, which will focus on developing new materials and incorporating additional features, in order to expand the platform beyond its law school roots.

    H2O is a  joint project of the Berkman Center for Internet & Society and the Harvard Law School library.  The Project Manager will be housed at the HLS Library and work in close collaboration with lead members of the Library Innovation Lab team; he/she will also work closely with the Berkman Center and current H2O teams. More info and job posting here.

  • Meme patrol: “When something online is free, you’re not the customer, you’re the product.”
  • I participated in the Berkman Center’s fascinating HyperPublic symposium in the summer of 2011.  When moderating a panel I invoked the aphorism that “When something online is free, you’re not the customer, you’re the product.”  It’s a way of encapsulating the idea that online free services usually make money by extracting lots of data from users — and then selling that data, or using it for targeted availability of those users for advertising, to advertisers.  In that sense, the advertisers are the clients, and the users enjoying free content are what’s being sold.  (Of course, sometimes that happens even when the user pays.)

    I didn’t coin the phrase, and since it was featured (and attributed to me!) in wordsmith.org’s wildly popular “word a day” as a thought for the day accompanying the word “enceinte” — I sought to nail down its provenance.

    The first use of the quote that we can find is as a comment within the famed MetaFilter community  in August 2010. The user’s name is blue_beetle, who might be someone named Andrew Lewis.  It’s entirely possible I saw it there, as MeFi is one of my five favorite sites on the Web.

    Similar sentiments (whether drawn from that source or independently invented) have been expressed by Bruce Schneier in October 2010 and by Douglas Rushkoff in September ’11.

    The phrase “you’re the product” also apparently appeared in a 1986 speech by President Reagan about the drug war.

    Just say know.

    –KA and JZ

  • OS X Mountain Lion and Gatekeeper
  • This week, Apple announced that it was moving to a new, faster OS X operating system development cycle, starting with the release of Mountain Lion next summer.  It previewed a number of features for the OS, and released some parts in beta.

    Mountain Lion is slated to include a feature called Gatekeeper as part of the security and privacy settings. Gatekeeper allows administrators (those with full privileges on a Mac) to limit the applications that can run on the Mac.  They can choose among allowing apps downloaded from the Mac App Store only, or apps from outside the Store so long as they are digitally signed to Apple’s satisfaction by their developers, or apps from anywhere.  (The latter has been the way both Mac and Windows PCs have worked, for better or worse, since the introduction of the Apple II in 1977.) Read more »

  • GPS-based Insurance Rates: The Devil is in the (Data) Details
  • A British insurance company called Motaquote has teamed up with TomTom, the GPS manufacturer to offer insurance prices based on data gathered by GPS. Fair Pay Insurance, Motaquote’s new program, is an opt-in insurance pricing scheme where drivers will get a free GPS unit in return for potentially lower (but possibly higher) premiums. The GPS unit will provide all the traditional navigational services as well as warn drivers when they corner too sharply or brake too hard. Read more »

About Jonathan Zittrain

jonathan zittrain

Jonathan Zittrain is Professor of Law at Harvard Law School and co-founder of the Berkman Center for Internet and Society at Harvard Law School

RSS Tweets from Z

  • An error has occurred; the feed is probably down. Try again later.

Blog Archives



Creative Commons BY-NC-SA Jonathan Zittrain unless otherwise noted.
Powered by WordPress using Gridline Lite.