Jul 05 2008

One Month is Enough

Category: IdeasOren Hurvitz @ 8:31 pm
No Photos! (photo by vinnie bezoomny)

No Photos! (photo by vinnie bezoomny)

Dear friends,

The first rule of crisis management is to get ahead of the story. Since my shameful secret is about to be revealed, I decided to break it here first. I’d rather you heard it from me than from the media:

In March 2008 I watched Rick Astley’s music video Never Gonna Give You Up on YouTube. It’s widely considered to be the most corny music video ever created. I have no excuse; I can’t even claim to have been RickRolled. I heard about the video, and willingly went and viewed it. It was me, just me, officer!

The reason for this confession is that Google is about to hand over to Viacom a complete list of every video watched by YouTube users:

[...] the judge granted a Viacom motion that records of every video watched by YouTube users, including their login names and IP addresses, be turned over to the entertainment giant.

The order prevents Viacom from using this information to target lawsuits at users. But it makes no sense to give this information to Viacom in the first place: Google could easily make this data anonymous, and they’ve asked Viacom to do just that. Viacom have said that they won’t use any personally identifiable data, but they haven’t replied to Google’s request directly. These mixed signals make me lunge for my tin foil hat: what could explain Viacom’s behavior? Perhaps, once they have the logs in their possession, they intend to ask the judge to allow them greater use of the data. Or perhaps the data will be “accidentally” leaked — after all, that sort of thing happens all the time.

But criticizing a media company like Viacom for ignoring users’ privacy is like berating a toddler for getting food all over themselves: it’s in their nature, and they’re going to keep doing it. Let’s beat up on Google instead, that never gets old. Google shouldn’t have kept this data around for Viacom to subpoena. Google deletes personally identifiable user data after 18 months, which isn’t enough to hide my Rick Astley obsession. Google’s track record on privacy is spotty in general. For example, after a lot of pressure they finally added a link to their privacy policy on the Google homepage in July 2008, not before bitching and moaning like a teenager whose parents have forced him to clean his room.

Google has some of the most sensitive data in the world; in particular, they know every search that a user makes. In their Privacy FAQ they list several good reasons why they need to keep this data:

  • To improve search results
  • To maintain the security of their systems
  • To prevent fraud and other abuses

It’s true that in order to achieve these goals Google needs to save the search logs. However, the problem isn’t that they keep the search logs; it’s that they keep personally identifiable information in the logs, which lets them (or anyone else, such as Viacom) associate searches and clicks with real people. Google keeps this information for 18 months, and that’s far too long. They could erase the personal information much sooner and still achieve all of the goals described above.

For example, Google use the search logs to find common spelling mistakes made by users, so that they can offer automatic suggestions for the correct spelling. This doesn’t require any personally identifiable information. Another use for the search logs is to detect click fraud. For this purpose it is indeed useful to look at the search and click history of individual users. However, the benefit of this personal data quickly diminishes with time. Data about click fraud that is over a month old should be considered prehistoric; the perpetrators are long gone from whatever IP they had been using.

Private Property (photo by Zervas)

Private Property (photo by Zervas)

Google’s privacy policy doesn’t say how long they keep search logs; probably forever. The only promise they make is to scrub out personally identifiable information after 18 months. Google are very vague about where this figure of “18 months” comes from; perhaps it has some religious significance. From Google’s Privacy FAQ:

Why are logs kept for 18 months before being anonymized?

We strike a reasonable balance between the competing pressures we face, such as the privacy of our users, the security of our systems and the need for innovation. We believe 18 months strikes the right balance.

It’s time we told Google: 18 months is too long. One month would strike the right balance between privacy, security and the need for innovation. With one month of personally identifiable information, Google will be able to catch all the fraud they are ever likely to catch. After that, it’s time to anonymize the data. The anonymized data is still useful for improving their search engine.

Go to Google’s Privacy Feedback page and ask them to reduce the amount of time they keep personally identifiable data in their logs. You could use a message such as this one:

Dear Google,I’m concerned about your data retention policy: you keep user identifiable information in your search logs for 18 months, and that’s too long. As we have seen with the recent lawsuit by Viacom, this information can easily fall into the hands of third parties. To protect my privacy and the privacy of the rest of your users, please reduce the amount of time you keep personally identifiable data to one month. Thank you.

Google isn’t alone in this. Microsoft also anonymizes its logs after 18 months. Yahoo makes do with just 13 months (how did they come up with that number? Perhaps it also holds occult significance). Ask.com, the fourth-largest search provider, gives its users the option of making completely anonymous searches. But we should focus on Google: where the market leader goes, the rest will surely follow.


Jun 17 2008

Apple’s Engineers: an Unexpected Profit Center

Category: IdeasOren Hurvitz @ 10:19 pm

According to salary information collected by new startup Glassdoor, Apple pays its engineers significantly less than competing companies in Silicon Valley. Apple engineers make $89,000 a year, whereas Google engineers can buy four more Segways a year (pre-tax) with their $112,573 paycheck. Microsoft and Yahoo are closer to Google: both companies pay their engineers $105,000 a year. See TechCrunch’s review of Glassdoor for the data.

I wondered how much of a difference this salary disparity made to Apple’s bottom line, so I took a look at its annual 10-K filings from 2003 to 2007. Each of these reports includes, buried among its 170 pages, Apple’s net income and how much it spent on R&D. For simplicity I assumed that the R&D budget was entirely spent on salaries; this isn’t far off the mark in a hi-tech company like Apple.

If Apple were to pay its engineers the same salaries as Google then its R&D budget would increase by 26%. This amount (26% of the R&D budget) is how much Apple saves each year by paying below-market salaries. I calculated what Apple’s net income would have been if it had paid its engineers the same as Google, and these are the results:

Apple\'s Increase in Net Income - Table

Explanation:

  • All dollar values are in millions.
  • # Employees – from Apple’s 10-K.
  • R&D Budget – from Apple’s 10-K.
  • Adjusted R&D Budget – had Apple paid its engineers at the same level as Google, this would have been its R&D Budget.
  • Net Income – from Apple’s 10-K.
  • Adjusted Net Income – had Apple paid its engineers at the same level as Google, this would have been its Net Income.
  • Increase in Net Income – the magnitude by which Apple’s net income was higher that year compared to what it would have been had it paid salaries at the same level as Google.

The Adjusted Net Income is a good estimate, but it’s not completely accurate. For example, the increase in Apple’s R&D Budget would have meant that its expenses are higher, so it would have paid less taxes. But the overall trend is clear.

Here’s the Increase in Net Income in chart form:

Apple\'s Increase in Net Income

In 2003 and 2004, the effect of underpaying its engineers made a huge difference to Apple’s bottom line. In 2003, these savings turned around Apple’s year: from a loss to a small profit. In 2004, they doubled the profit. However, once Apple’s earnings began to skyrocket in 2005, the effect of the R&D savings became much smaller: just 6% of the net income in 2007, for example.

Paying low salaries to its engineers was a lifesaver for Apple during its difficult times. But now that Apple is immensely profitable there’s no more excuse for this practice. In the TechCrunch article mentioned previously, the site’s owner Michael Arrington says: “Apple software engineers make only about $89,000, on average, but they get to create some of the most loved products on Earth.” I’m sure this warms their hearts. But an extra $20,000 a year would make their hearts downright toasty, and their spouses’ as well.


Jun 01 2008

Run for office with Contendr

Category: Fun,IdeasOren Hurvitz @ 11:05 pm

The Obama compaign is hiring developers to create software for his presidential campaign. It was suggested to make this software open-source. But why stop there? Whenever a successful website comes along, someone invariably creates a service that lets anyone churn out a clone in five minutes:

  • Want your own social network? Ning.
  • Your own Digg? coRank.
  • A Wiki to call your own? Wetpaint.
  • Want to show the Twitter folks how to keep a site running? ReVou.
  • Starting a presidential campaign? Contendr!
Marlon Brando in

He coulda been a contender, if only he'd had Contendr. (Marlon Brando in "On the Waterfront")

Suggested features:

  • Collect signatures to get the candidate’s name on the ballot by harvesting .sig’s from Slashdot and other forums.
  • Ask for campaign contributions with a tip jar on the website.
  • Spread the candidate’s message by link-spamming the appropriate sites: Instapundit for Republicans or Daily Kos for Democrats. Actually, link-spam both sites; everyone deserves to hear what you’ve got to say.

The name is available (but sadly, the domain is not). Act now, and help democratize the democratic process!