Coming from the data mining and machine learning field, a few friends of mine have been kicking around the idea of whether data or analysis is a more valuable asset. Obviously the two have a symbiotic relationship to be valuable together. If you can have both you’ll take it, but what if you had to choose? Is it generally true that in most areas of business you have to specialize - which means prioritizing certain activities, skills, or knowledge over others. Should you choose to get better access to the data or should you choose to get better at analyzing whatever data you have available to you? If you’re in an analytics heavy field - is there an inevitability that one or the other will win in the long run? If not, under what conditions might one be the better option?
I originally started thinking of this in terms of “Which would cost you more if you had to buy it?” Would it be cheaper to have the ability to analyze data well, and have to “purchase” the data from someone else - or would it be cheaper to have the ability to obtain, store, and access the data well, and have to purchase the analysis from someone else? Microeconomics says that value is all about scarcity, so this line of thinking led me to conclude that decent analysts will always be cheaper (on a for-hire basis) than access to all but the most plentiful data. Therefore, if the data is abundant then it might make sense to specialize in its analysis, but under almost any other circumstances you should choose to be the one who has better access to the data. Point for data. Algorithm design, analysis, whatever you want to call it is basically a particular example of human ingenuity. Therefore at any moment in time, someone has come up with the “best” way to analyze some data for a particular purpose. However, innovative people all over the world are hard at work to put that person’s idea out to pasture even before the idea has been tested. Therefore, cutting edge analysis has two properties relevant to this discussion. First, it can increase the value of the data by a quantum leap when a new theory emerges. This makes it a very valuable contribution for a period of time. However, inevitably, how long this lasts is hard to predict. It could be the decisive contribution for a year, five years, a decade, or a few decades. The only thing we know for sure is that it will not be king forever. Still, whoever owns the best analysis in the world for a given situation is arguably bringing a very unique value contribution to the table and if protected properly can be quite scarce. Point analysis.
Brad Burnham at Union Square Ventures considered the problem in a much better way. He posits that data has an increasing marginal utility. This is a very important characteristic which says that every additional piece of data you get is more valuable than the previous one you got. Why? Because with proper analysis, you can tie that single new piece of information together with potentially all of your previous data. This network effect of data is very insightful. It says that knowing your location and knowing what you search for on the web are both valuable in their own right, but knowing them together is even better (think relevant & localized ads). Nice work Brad. Another point for the data.
What about analysis, does it have any “network effects” or compounding effects to counter the effects we just saw that collecting more data has? I think it might. The data people have a cost structure problem. The cost of getting & storing a lot of data can be quite high. The trend is that data from transaction or event monitoring is growing exponentially. The cost to store the data is also declining exponentially, which is good news for the data guys, it means they have a chance at least. However, the costs to access & transmit the data is not decreasing nearly as fast. Also, the cost to power (literally, in terms of electricity) the storage systems is becoming a problem, and not decreasing nearly fast enough to be meaningful. Point analysis.
The more I think about, my gut instinct says that data is the only way to keep a lasting competitive advantage. However, if you want to make a quick strike against your competition, it seems that analysis may be the way to go. Perhaps the right strategy is to try to shift from one to the other? This could definitely use some more consideration. I’m going to be on the look out for some good case studies out there. Maybe Google, eBay, and even the phone companies provide good examples on what to care about here.