Archive

Posts Tagged ‘wtf’

Ball Drop: Dell’s Faulty Product Page

February 20th, 2010

As a number of perturbed status updates I’d posted to my Facebook profile in the wee hours of Friday morning suggested to my friends this AM, the health of my Mac Mini, Cylon.local, took a bit of a nose dive last night. Now, it’s probably just a hard drive failure, which is actually not so bad1, but I won’t know for sure until I take the little fella down to Tekserve‘s “ER” this weekend and get it properly diagnosed.

So one of the thoughts that naturally occurred to me is that there’s at least some small chance that Cylon.local won’t be coming back; perhaps the resurrection ship was simply too far away when the dreadful moment arrived.

I’d just bought a Mac Mini for my parents this past Christmas, so I already know the value proposition of replacing it with the latest model.

But, while I’m entertaining the notion of replacement hardware, it occurs to me that Dell rolled out a competitor a few months ago, called the Inspiron Zino HD. Now don’t get me wrong: I’m quite happy with the Mini’s performance over the last four years, and I’d be happy to keep it for as long as it’ll stick around with me, but any sensible man would think to check in on his options. Read more…

Footnotes

  1. Thanks to the trusty Time Capsule I managed to snag with a friend’s NYC teacher’s discount — thanks, Jenny!

Business Sense , , , , , , , ,

Climategate: a Case Study in How Not to Conduct Research

November 25th, 2009

Sometimes events arrive with a timing that is both serendipitous and uncanny. Only days after my last post, wherein I state a case for the growing importance of referencing the datasets and algorithms used in the distillation of research conclusions, comes a story about leaked correspondence records (email messages) amongst climate researchers working in affiliation with the East Anglia Climate Research Unit, or CRU.

From the NYT article:

The e-mail messages, attributed to prominent American and British climate researchers, include discussions of scientific data and whether it should be released, exchanges about how best to combat the arguments of skeptics…. Drafts of scientific papers … were also among the hacked data, some of which dates back 13 years.

To say the least, the leak contains some juicy fodder for skeptics of human-driven climate change amongst the leaked materials.

Amongst these leaked emails, for example, are conversations which document various difficulties some of the CRU’s climate researchers have encountered over the years in trying to work with the data collected and managed by the organization. The Times article focuses on a discussion thread in which researcher Phil Jones mentions using a “trick” — originally employed by another colleague, Michael Mann — to “hide [a] decline” in temperatures apparently shown in some set of data.

In an interview about the leaked emails, Dr. Mann attempts to defuse the statement as a poor choice of words. Unfortunately, whether he’s being sincere or not, his is frankly a response that’s to be expected.

The article continues:

Some skeptics asserted Friday that the correspondence revealed an effort to withhold scientific information. “This is not a smoking gun; this is a mushroom cloud,” said Patrick J. Michaels, a climatologist who has long faulted evidence pointing to human-driven warming and is criticized in the documents.

This is also a statement that you’d expect from a climatologist building a career on a body of work disagreeing with the idea of human-driven warming. These emails are naturally material that skeptics of the human-driven climate change argument will latch onto (and, frankly, they certainly should; it’s just how scientific work is tested — through dispute).

The next several days sees a flurry of activity throughout the media and the blogosphere.

Before long, the name “Climategate” (kitschy but concise) gets attached to the discussions about the leaked materials. And since there’s a bit of both data and program source code in the mix, techies from around the world immediately jump into the fray.

One of the most popular files from the leak discussed most heavily in techie circles is called HARRY_READ_ME.txt (copies available in both original format and more structured edition). The story that unfolds in this file reveals the plight of a programmer named Harry who had struggled for three years, attempting to reproduce some research results with a collection of data and the source code for an algorithm created to calculate research conclusions.

Sadly, this man’s three-year effort to reproduce the published results with the given material never succeeded. Here’s an excerpt from the file, for a glimpse at this poor fella’s mounting frustrations along the way:

getting seriously fed up with the state of the Australian data. so many new stations have been introduced, so many false references.. so many changes that aren’t documented. Every time a cloud forms I’m presented with a bewildering selection of similar-sounding sites, some with references, some with WMO codes, and some with both. And if I look up the station metadata with one of the local references, chances are the WMO code will be wrong (another station will have it) and the lat/lon will be wrong too. I’ve been at it for well over an hour, and I’ve reached the 294th station in the tmin database. Out of over 14,000. Now even accepting that it will get easier (as clouds can only be formed of what’s ahead of you), it is still very daunting. I go on leave for 10 days after tomorrow, and if I leave it running it isn’t likely to be there when I return! As to whether my ‘action dump’ will work (to save repetition).. who knows?

Yay! Two-and-a-half hours into the exercise and I’m in Argentina!

Pfft.. and back to Australia almost immediately :-( .. and then Chile. Getting there.

Unfortunately, after around 160 minutes of uninterrupted decision making, my screen has started to black out for half a second at a time. More video cable problems – but why now?!! The count is up to 1007 though.

I am very sorry to report that the rest of the databases seem to be in nearly as poor a state as Australia was. There are hundreds if not thousands of pairs of dummy stations, one with no WMO and one with, usually overlapping and with the same station name and very similar coordinates. I know it could be old and new stations, but why such large overlaps if that’s the case? Aarrggghhh! There truly is no end in sight.

Assuming the original conclusions he was attempting to reproduce were all based on this data (and, there’s frankly no reason not to), it’s impossible to invest much confidence in their validity.

Martin points out that the data and algorithms with which Harry was working were “inherited” from a previous researcher (or researchers), and came in a poorly-organized bundle with poor documentation. And what’s worse, he didn’t have access to anyone who had originally derived the conclusions he was tasked to reproduce.1

The real egg in the face of this anecdote is the fact that CRU has clearly done an atrocious job at properly archiving their data, and documenting the work their researchers produce. Naturally this level of disorganization is a serious problem anywhere it may occur, but it’s a particularly glaring issue in the field of scientific research, where the validity of research results lies squarely upon the ability of independent third parties to reliably reproduce those results on their own. Yet here we find that the CRU is demonstrated to have either managed their data so poorly as to prevent its own scientists from being able to reproduce the organization’s own published results (in which case “embarrassing” doesn’t even begin to describe the situation), or to have manipulated the data and produced false results. And the fact is that either story tells a horrible tale about the CRU.

Charlie Martin, in a post to the Pajamas Media blog, writes:

I think there’s a good reason the CRU didn’t want to give their data to people trying to replicate their work.

It’s in such a mess that they can’t replicate their own results.

This is not, sadly, all that unusual. Simply put, scientists aren’t software engineers. They don’t keep their code in nice packages and they tend to use whatever language they’re comfortable with. Even if they were taught to keep good research notes in the past, it’s not unusual for things to get sloppy later. But put this in the context of what else we know from the CRU data dump:

  1. They didn’t want to release their data or code, and they particularly weren’t interested in releasing any intermediate steps that would help someone else

  2. They clearly have some history of massaging the data… to get it to fit their other results….

  3. They had successfully managed to restrict peer review to … the small group of true believers they knew could be trusted to say the right things.

As a result, it looks like they found themselves trapped. They had the big research organizations, the big grants — and when they found themselves challenged, they discovered they’d built their conclusions on fine beach sand.

I won’t belabor the discussion of the implications these leaked documents offer; there is no shortage of people writing about exactly that. In case you’re interested in some of the more detailed coverage of the tech community’s review of the leaked data and algorithms, I would point you to the following pieces:

There’s also some great ongoing coverage at Devil’s Kitchen.

Regardless whether or not there’s any merit to any of the CRU’s climate research, however, this little drama leaves me unable to resist repeating an argument from my last post:

But with all these arguments and assertions about corollaries, trends, and predictions that this number crunching activity will generate, it will become increasingly crucial to have a mechanism by which the results claimed to have been derived from the number-crunching can be accounted for.

It must … become incumbent upon anybody publishing findings derived from mining such data to share both the sources and processes used to derive their results or conclusions. In cases of claims rooted in the fruits of data mining endeavors, it is specifically important that results indicate:

  1. exactly which data sets it draws from, and

  2. precisely which algorithm(s) processed the data in question.

At this point, the specific implications this debacle has for the CRU’s research is irrelevant. For, whether by deceit or incompetence, this leaked data has left their published research about climate change completely unreliable.

Yet developing a confident clarity around the subject of their research remains of critical importance, for climate change is a real challenge that humankind must cope with. Regardless whether or not human industrial activity is a driving factor for climate change, the fact is that the ice at our poles is melting at an accelerating rate. Decades worth of satellite photos and other survey data sufficiently demonstrate this fact. We similarly have data collected over the last several decades by the world’s meteorologists that global mean temperatures seem to be rising, as well as increasing levels of extreme weather (from droughts and famines to floods and more) around the world.

The climate debate isn’t over whether these events are occurring, but instead whether human industrial activity accounts for a relevant piece of it.

Governments around the planet will be forced to take some sort of action to deal with the prospective repercussions of these changes (e.g., rising sea levels, expansion of the Sahara, and the rest). The consideration at stake, therefore, is how each country will individually and collectively direct their efforts and invest their resources in dealing with it.

If human industrial activity has bearing on the matter, we’ll have to make some serious policy changes and invest heavily in developing alternative methods of production, lest we imperil our own (and other) species. But if, on the other hand, our industrial activity is not a determining factor in climate change, our efforts are best spent trying to figure out how we’re going to deal with the realities of a changing climate that we cannot mitigate simply by being more responsible with our emissions.

In any case, everyone needs to make informed decisions about where they’re investing their money and efforts.

And so a number of the world’s governmental and industrial leaders (including US President Barack Obama) are scheduled to meet — along with members of the climate research community — at the United Nations Climate Change Conference in Copenhagen this December in an attempt to work out policy directions to deal with climate change. I’m hoping the event will focus on methods to improve and reinforce confidence in the remainder of the climate research work being conducted around the world, and that it won’t turn into a political food fight.

Fingers crossed.

I am left hoping that some real good can rise from this mess. And so I call on climate change researchers and institutions around the world to take this opportunity develop the practice of providing full disclosure on the sources of their data sets and the functionality of their algorithms. There will likely be many political, legal, and logistical obstacles to address and overcome in this effort, but failure to do so carries stakes that are simply too high.

Footnotes

  1. I personally have plenty of experience attempting to work with poorly-documented code and data inherited from some previous person’s work, and can directly attest to the maddening up-hill battle of that situation.

General Thoughts , , , ,

A Fear of the People

May 24th, 2009

The Iranian government has blocked access to Facebook. This in the run up to its June 12 presidential elections.

The blessing hiding behind the headline to this story is that this action boldly demonstrates just how much Ahmadinejad’s administration fears the thought of what may be possible if its governed citizens have the opportunity to communicate with each other in such a public and open forum.

Good luck ousting the douche bag, Iran.

General Thoughts , , , ,

Apple Launches A Revolution… and Then Gets Overtaken?

May 4th, 2009

At least, that’s what Richard Wray and Bobbie Johnson of the Guardian conclude:

… while Apple caused a revolution [with the iPhone], it is unlikely to become dominant in the market. It has sold just over 20m iPhones since the first device appeared in 2007; in that time more than 1.5bn phones have been shipped by everyone else. A similar thing happened with the personal computer market. The concept was championed by Apple when it launched Apple II, the worlds first personal computer, in 1977, and the first Macintosh in 1984, but other players now lead the market.

This argument — whose conclusion, for some reason, hinges strictly on unit sales of the iPhone units sold vs. the rest of the phones in the market as a metric for performance — overlooks several critical points:

  1. Looking at sales numbers for all phones worldwide is meaningless; rather an examination of the so-called “smart phone” sales numbers would make for more meaningful insights.
  2. The iPhone isn’t available in every market — in order to make a meaningful point about its sales performance, they ought to (at least also) isolate its relative performance in the markets the product is actually available in.
  3. The rest of their competitors have been selling devices for a decade or more.

I guess it makes for a headline that gets the click-throughs, but since their conclusion is antithetical to the understanding everyone else looking at the iPhone’s market performance has, I would suggest merely that I’d like to see them support their assertion with something slightly more compelling and meaningful than comparing iPhone sales versus the rest of mobile phones on the planet.

Responses , ,

Palm Pre-Fucking Themselves?

April 30th, 2009

TechCrunch is reporting that Palm is already creating a second WebOS device, code named Pixie, that will be a “low end” version of its upcoming (and as-yet unreleased) Pre.

From the post:

The Pixie will use the same WebOS operating system and software as the Palm Pre, but in a smaller candy-bar form factor and a target $99 price point, say our sources. It will be released only a few months after the Pre in June, so this isn’t an upgrade device. It’s targeted at the lower end of the market.

I’m sorry, but this is all sorts of stupid.

For one, prove the concept first. Get the software out there, see what the initial responses are, and refine a bit. See if it sells. See what is and isn’t not working just right; it’s entirely possible a fundamental design decision of the hardware somehow needs to be reconsidered based in this information. Then — with some market reaction data in hand — think about a different flavor of hardware that could at least have the benefit of being informed by response to the initial model.

Secondly, they’re launching a new platform which will have to court developers into its fold with two different screen sizes. Developers interested in targeting both will therefore have a two-device testing matrix out of the gates. Some will go for the full-size features of the Pre, and others will target the Pixie.

Finally, but most importantly, they’re going to fracture consumer interest. Bringing two similar products to market like this will force prospective consumers to ask themselves which one they want, and whether they “really need” whatever’s in the “big” one. In these times, more consumers are likely to steer in an economical direction, given what may well be a relatively equal set of choices, and since Palm would presumably use lower-powered hardware for the Pixie, the general consumer experience will likely feel more constrained.

I’m scared to death that Palm is killing the most promising new smartphone announced in a little over two years, before the poor thing has even come into the market.

There seems to be some question as to whether the project is completely green-lit, as Arrington mentions:

One source says it’s full steam ahead. Another says Palm is waiting to see how the Pre does before announcing the Pixie.

But, frankly, letting the information leak out like this (and at least having someone think this effort is “full steam ahead”) is enough to every bit as much harm as the actual release of the product; the anticipators will begin to demand its release, and start to play a wait-and-see, betting that Palm eventually will drop the Pixie.

And the pundits will buzz about it, and Palm will see weak sales on its Pre, and cave into the expectation of the Pixie’s release.

As I’d previously said: as die-hard an iPhone fan as I indeed am, I have been keenly looking forward to the Pre, both as a product of wonder and accomplishment, and as some great and much-needed competition for the iPhone, which will only make my favorite phone better, in the long-run.

Palm: good luck.

30 Apr @ 14:34: Let me add that this, of course, is merely rumor at this time; Palm has not officially announced anything. That said, TechCrunch has another post claiming to have dug up even more details. I would note that the claimed “shaky” photo of the Pixie in my original link above differs significantly in proportion from what is shown on the photo in this latest post.

Really? , ,

Mesopotamia 2.0

April 21st, 2009

A bunch of Silicon Valley execs are in Iraq, apparently “explore new opportunities to support Iraqi government and non-government stakeholders in Iraq’s emerging new media industry.”

From the press release:

The delegation [...] will provide conceptual input as well as ideas on how new technologies can be used to build local capacity, foster greater transparency and accountability, build upon anti-corruption efforts, promote critical thinking in the classroom, scale-up civil society, and further empower local entities and individuals by providing the tools for network building.

I can’t decide whether this is pure genius or utter madness. [Via TechCrunch]

Check it out , ,

The Cram

April 3rd, 2009

In contrast with the spirit of yesterday’s link to Designing Convertbots application comes news of the confirmed continuation of effort to bring Microsoft Office to the iPhone.

Absent an iPhone OS device with a significantly larger screen, I can’t imagine any purpose to cramming the UI of Word (or Excel!) into as space with the iPhone’s screen real estate. If Microsoft had any track record for pulling off surprisingly excellent (or even simple and focused) UI solutions, I’d leave room for a verdict.

Who knows? Maybe they’ll actually impress this time.

UI Design , , ,

A Perfect 10 (potentially… down the line)

February 17th, 2009

So Courtney Gaines, of the Telegraph’s “Gadget Inspectors” gadget reviews series rates the HTC G1 (the first Google phone) with a a 10/10 rating, citing capabilities that are theoretically possible in upcoming Android-based phones… what?!

Really? , , ,

John Gruber’s “Interface of the Week”

December 8th, 2008