<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Signal/Noise &#187; analytics</title>
	<atom:link href="http://billpetti.com/tag/analytics/feed/" rel="self" type="application/rss+xml" />
	<link>http://billpetti.com</link>
	<description>Trying to separate the signal from the noise, one post at a time.</description>
	<lastBuildDate>Fri, 30 Sep 2011 21:49:18 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='billpetti.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/6cef924e9e2296437300917a41fb5f9c?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Signal/Noise &#187; analytics</title>
		<link>http://billpetti.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://billpetti.com/osd.xml" title="Signal/Noise" />
	<atom:link rel='hub' href='http://billpetti.com/?pushpress=hub'/>
		<item>
		<title>Book Review: The Numbers Game</title>
		<link>http://billpetti.com/2010/11/30/book-review-the-numbers-game/</link>
		<comments>http://billpetti.com/2010/11/30/book-review-the-numbers-game/#comments</comments>
		<pubDate>Tue, 30 Nov 2010 17:49:03 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[book review]]></category>
		<category><![CDATA[management]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=2930</guid>
		<description><![CDATA[Alan Schwarz&#8217;s The Numbers Game is an indispensable look at how the numbers that have come to define the game of baseball came to be.  The book is less about the hallowed numbers that even casual fans can identify; Aaron&#8217;s 755 home runs, DiMaggio&#8217;s 56 game hit-streak, Nolan Ryan&#8217;s 5714 strikeouts, Cy Young&#8217;s 511 wins, Pete Rose&#8217;s [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2930&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft" src="http://ecx.images-amazon.com/images/I/512K9H0ENZL._SL500_.jpg" alt="" width="147" height="240" />Alan Schwarz&#8217;s <em><a href="http://amzn.to/bug9L9">The Numbers Game</a></em> is an indispensable look at how the numbers that have come to define the game of baseball came to be.  The book is less about the hallowed numbers that even casual fans can identify; Aaron&#8217;s 755 home runs, DiMaggio&#8217;s 56 game hit-streak, Nolan Ryan&#8217;s 5714 strikeouts, Cy Young&#8217;s 511 wins, Pete Rose&#8217;s 4256 hits, Rickey Henderson&#8217;s 1406 stolen bases, etc.  Instead, Schwarz looks back over time to reconstruct how specific statistics were created and how those statistics were subsequently accepted as the definitive measurements of player performance.  The book will definitely appeal to diehard fans of baseball and those that love to analyze the game. However, much like its contemporary, Michael Lewis&#8217; <em><a href="http://bit.ly/8TczPA">Moneyball</a>, </em>Schwarz&#8217;s book provides insights into the management and analysis of any organization.</p>
<p>Schwarz traces the history of baseball&#8217;s obsession with statistics to Henry Chadwick, a journalist and baseball writer widely acknowledged as the grandfather of baseball statistics.  Chadwick&#8217;s work in the mid- to late-19th century laid the foundation for much of the statistical framework through which we appreciate the game today.  Chadwick was adamant that the new game of baseball required a fair accounting of player performance:</p>
<blockquote><p>In order to obtain an accurate estimate of a player&#8217;s skill, an analysis, both of his play at bat and in the field, should be made, inclusive of the way in which he was put out; and that this may be done, it is requisite that all&#8230;contests should be recorded in a uniform manner.</p></blockquote>
<p>Anyone who has paid even scant attention to the debate regarding traditional and sabermetric approaches to the game of baseball will recognize Chadwick&#8217;s logic&#8211;not only do position players contribute to their team&#8217;s success by creating runs through their offense; they can also prevent runs by the other team through their defense.  Traditionally, players came to be valued and compensated based largely on their offensive production.  When it came time to arbitrate salaries or negotiate free agent contracts, offensive statistics carried the most weight.  (Whether or not the offensive statistics being used were the most accurate is a much larger debate, and Schwarz gives ample space to this history as well).</p>
<p>As the analysis of baseball became more sophisticated, analysts were finally able to measure a player&#8217;s total value by incorporating runs produced through offense as well as those saved by defense.  Rather than relying on the traditional fielding percentage (which simply measured the number of chances a fielder converted to an out), more sophisticated measures allowed for talent evaluators to look at how many runs a player saved in the field.  Power hitting shortstops surely contributed to their team&#8217;s success by creating runs, but their light hitting counterparts could conceivable contribute just as much by saving runs.</p>
<p>Chadwick&#8217;s quote highlights two critical issues for any organization, and it is a theme that runs through <em>The Numbers Game</em>; ensuring that the metrics you rely on account for all of the ways a person contributes to success and that the data used to calculate those metrics is collected in a consistent, uniform manner.  <span id="more-2930"></span>For years, teams overpaid for free agents whose offensive statistics were not viewed in the context of their defensive statistics.  In some cases, a player&#8217;s offensive production was either tempered or cancelled out by their poor defensive play&#8211;in short, they were giving up as many runs with their poor defense as they were earning through their offense.  While many organizations today have embraced the &#8220;analytics revolution&#8221; one wonders to what extent they truly know what metrics matter.  Like traditional owners and managers in baseball, it is likely that many businesses assume they know what the proper metrics are&#8211;what behaviors contribute the most to success.  The lesson from baseball is that accepted wisdom, personal experience, and &#8220;gut feelings&#8221; may be right, but they must be subjected to rigorous analysis to determine the best way to evaluate performance and drive success.</p>
<p><em>The Numbers Game </em>is full of examples of how certain metrics came to be the accepted wisdom and the difficulty of any endeavor to unseat them in favor of better measures.  Besides baseball fans, anyone who is leading a business analytics initiative or responsible for performance and compensation will find this book rewarding.</p>
<br /> Tagged: <a href='http://billpetti.com/tag/analytics/'>analytics</a>, <a href='http://billpetti.com/tag/baseball/'>baseball</a>, <a href='http://billpetti.com/tag/book-review/'>book review</a>, <a href='http://billpetti.com/tag/management/'>management</a>, <a href='http://billpetti.com/tag/statistics/'>statistics</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/2930/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/2930/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/2930/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2930&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2010/11/30/book-review-the-numbers-game/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>

		<media:content url="http://ecx.images-amazon.com/images/I/512K9H0ENZL._SL500_.jpg" medium="image" />
	</item>
		<item>
		<title>The Key to Fantasy Glory is Consistency (well, that&#8217;s what I am betting on this year, anyway)</title>
		<link>http://billpetti.com/2010/08/30/the-key-to-fantasy-glory-is-consistency-well-thats-what-i-am-betting-on-this-year-anyway/</link>
		<comments>http://billpetti.com/2010/08/30/the-key-to-fantasy-glory-is-consistency-well-thats-what-i-am-betting-on-this-year-anyway/#comments</comments>
		<pubDate>Mon, 30 Aug 2010 20:18:59 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[fantasy football]]></category>
		<category><![CDATA[moneyball]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=2633</guid>
		<description><![CDATA[It&#8217;s that time of year again.  The passing of the summer, the start of the fall.  Most importantly, it signals the start of that most magical of times&#8211;the start of the fantasy football season. This year I am in three leagues&#8211;one for work, one for &#8220;stat-geeks&#8221;, and one for family and friends.  I decided to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2633&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s that time of year again.  The passing of the summer, the start of the fall.  Most importantly, it signals the start of that most magical of times&#8211;the start of the fantasy football season.</p>
<p>This year I am in three leagues&#8211;one for work, one for &#8220;stat-geeks&#8221;, and one for family and friends.  I decided to tweak the way I approached the draft this year and wanted to share a bit of the strategy with readers.</p>
<p>One of the biggest problems with standard pre-draft rankings, particularly those of the big fantasy hosting sites (e.g. ESPN, CBS, etc) is that the rankings are based solely on aggregate measures of performance such as total projected points for the upcoming season.  Now, of course the goal is to assemble a team with players that end up scoring lots of points throughout the season, but total points scored ignores the fact that teams compete head-to-head, week-to-week.  In order to make the playoffs a team has to outscore opponents on a consistent basis in order to accumulate wins, not just points.  That means drafting players that not only score a lot of points, but score a lot of points week in and week out.  When it comes to deciding between which players to draft, managers would be better off selecting consistent scorers versus boom-or-bust players (at least, that is my hypothesis).</p>
<p>Let&#8217;s take a look at two hypothetical players: <span id="more-2633"></span></p>
<p style="text-align:center;"><img class="size-full wp-image-2645 aligncenter" title="Player A_Player B" src="http://billpetti.files.wordpress.com/2010/08/player-a_player-b.jpg?w=468" alt=""   /></p>
<p>Over the course of four weeks both players score the same amount of total points.  However, Player A is clearly a boom-or-bust player while Player B is more consistent week-to-week.  Player A gives you a great chance to win Weeks 1 and 4, but makes it much hard to win in Weeks 2 and 3.  On the other hand, Player B is the model of consistency, giving you a great chance to win each week.  On most pre-draft rankings, Players A and B will look like equally valuable picks, but this is misleading.</p>
<p>This year, I decided to see whether a player&#8217;s penchant for boom-or-bust performances was at all consistent and predictable.  The initial answer seems to be yes.</p>
<p>I developed two metrics; one to evaluate high scoring consistency and one that takes predicted points and combines them with scoring consistency.  The first, ConBoom, measures, weights, and then combines the number of times a player scored &gt;=20 points, &gt;=15 points, and &lt;10 points per game over the course of a season.  This is the foundation of the consistency metric.  ConRank combines the ConBoom score for a player with a weighted measure of that player&#8217;s predicted total points for the upcoming season.  (How am I weighting each component of the measures?  Well now I can&#8217;t reveal the entire secret sauce, now can I?)</p>
<p>I validated the measures against the past three years of actual player data and found that ConBoom scores from one year were highly correlated with ConBoom scores the next year (.70).</p>
<p>I am going to use the new metric to guide my drafts in all three leagues and essentially test how my teams fair against other teams over the course of the season.  With data and predictions for every player I&#8217;ll be able to test the method over three league scoring systems, 32 teams, and 256 games as well as validate the measures predictive attributes over another year.</p>
<p>Draft number one is tonight.  Let the games begin!</p>
<br /> Tagged: <a href='http://billpetti.com/tag/analytics/'>analytics</a>, <a href='http://billpetti.com/tag/fantasy-football/'>fantasy football</a>, <a href='http://billpetti.com/tag/moneyball/'>moneyball</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/2633/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/2633/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/2633/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2633&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2010/08/30/the-key-to-fantasy-glory-is-consistency-well-thats-what-i-am-betting-on-this-year-anyway/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>

		<media:content url="http://billpetti.files.wordpress.com/2010/08/player-a_player-b.jpg" medium="image">
			<media:title type="html">Player A_Player B</media:title>
		</media:content>
	</item>
		<item>
		<title>What gets measured (and valued) gets done</title>
		<link>http://billpetti.com/2010/06/30/what-gets-measured-and-valued-gets-done/</link>
		<comments>http://billpetti.com/2010/06/30/what-gets-measured-and-valued-gets-done/#comments</comments>
		<pubDate>Wed, 30 Jun 2010 09:18:08 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Business]]></category>
		<category><![CDATA[management]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=2340</guid>
		<description><![CDATA[Most everyone has heard Peter Drucker&#8217;s famous dictum &#8220;What gets measured gets done&#8221;, the implication being that unless specific behaviors and outcomes are measured they aren&#8217;t likely to be given much attention.  In this month&#8217;s HBR, Dan Ariely suggests that this notion of measurement-driving-behavior explains many of the problems with current CEO behavior and suggests [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2340&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>Most everyone has heard Peter Drucker&#8217;s famous dictum &#8220;What gets measured gets done&#8221;, the implication being that unless specific behaviors and outcomes are measured they aren&#8217;t likely to be given much attention.  In this month&#8217;s HBR, <a href="http://hbr.org/2010/06/column-you-are-what-you-measure/ar/1" target="_blank">Dan Ariely suggests</a> that this notion of measurement-driving-behavior explains many of the problems with current CEO behavior and suggests a solution:</p>
<blockquote><p>A loose consensus has formed around the idea that basing CEO pay on, say, five years of stock returns would eliminate some of the reckless decision making that led to the Great Recession. But I suspect that even if you could build a compensation plan that focuses on long-term shareholder value, you’d solve only part of the problem.</p>
<p>That’s because such a scheme still ties CEOs’ motivation to one fickle number—company share price—and assumes that pay alone motivates chief executives to perform.</p>
<p>Any number of things can motivate CEOs—peer recognition, for example, and even a desire to change the world. In fact, CEOs usually have all the money they need. Why then does it seem that they care more about stock value and the compensation it produces than those other forms of motivation?</p>
<p>The answer is almost uncomfortably simple: CEOs care about stock value because that’s how we measure them. If we want to change what they care about, we should change what we measure.</p></blockquote>
<p>Ariely goes on to argue that, in general, people align their behaviors with the criteria that others judge them on.  He suggests a number of additional metrics that CEO&#8217;s performance should be judged by, such as jobs created, pipeline of new products and patents, satisfaction level of customers, and trust in your company and brand, but says that these are difficult to measure.</p>
<p>I largely agree with Ariely, but slightly disagree on two points: <span id="more-2340"></span></p>
<p>First, it isn&#8217;t simply that what gets measured gets done, but rather what gets measured <strong><em>and valued</em></strong> gets done.  It is a slight difference, but an important one.  Businesses today are flush with data, statistics, metrics, and dashboards.  The decision as to which of those metrics to value, i.e. to use as a measuring stick of employee (and, in this case, CEO) success is an important one.  Take an example from baseball.  For decades, managers, talent scouts, and front office executives paid little attention to walks and on-base percentage.   Both were measured, but they weren&#8217;t properly valued&#8211;players with a high number of walks and a robust on-base percentage were not compensated at a high level because of their performance in these two areas.  It isn&#8217;t enough to measure&#8211;specific measures must also be publicly valued in order to affect behavior in the way that Ariely suggests.  The issue of what gets valued is an issue of culture change to a large extent&#8211;it certainly isn&#8217;t easy, but without it measurement will not affect behavior.</p>
<p>Second, many of the items that Ariely suggests are quite measurable.  My current employer, Gallup, is a global leader in the science of measuring customer satisfaction (or, as we would argue, <a href="http://www.gallup.com/consulting/49/Customer-Engagement.aspx" target="_blank">customer engagement</a>) and brand confidence or trust.  Specifically, Gallup has developed metrics for both that have been validated in relation to business outcomes (e.g. customer retention, profitability, etc).  Often times we assume that things are inherently difficult or impossible to measure and, therefore, fail to measure them at all (to be fair, Ariely doesn&#8217;t suggest this, he simply states that it is more difficult).</p>
<br /> Tagged: <a href='http://billpetti.com/tag/analytics/'>analytics</a>, <a href='http://billpetti.com/tag/business/'>Business</a>, <a href='http://billpetti.com/tag/management/'>management</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/2340/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/2340/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/2340/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=2340&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2010/06/30/what-gets-measured-and-valued-gets-done/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>
	</item>
		<item>
		<title>The Era of Big Data: IBM Gets It</title>
		<link>http://billpetti.com/2010/04/19/the-era-of-big-data-ibm-gets-it/</link>
		<comments>http://billpetti.com/2010/04/19/the-era-of-big-data-ibm-gets-it/#comments</comments>
		<pubDate>Mon, 19 Apr 2010 12:15:48 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Big Data]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=1959</guid>
		<description><![CDATA[I&#8217;ve written before about how IBM dove headfirst into the world of Big Data.  They&#8217;ve made a big bet on the revolutionary possibilities available to business, governments, and individuals given the revolution in data capture and analytics we are entering.  At this point you&#8217;ve all seen this point made in various ways through IBM&#8217;s Smarter Planet [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=1959&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve written before about how <a href="http://billpetti.com/2009/11/16/cloud-analytics-from-big-blue/" target="_blank">IBM dove headfirst</a> into the world of <a href="http://billpetti.com/2009/08/07/profiting-from-an-analytically-driven-world/" target="_blank">Big Data</a>.  They&#8217;ve made a big bet on the revolutionary possibilities available to business, governments, and individuals given the revolution in data capture and analytics we are entering.  At this point you&#8217;ve all seen this point made in various ways through IBM&#8217;s <a href="http://www.ibm.com/smarterplanet/us/en/" target="_blank">Smarter Planet campaign</a>.  Here&#8217;s one of the ads where they make their case for why data matters:</p>
<span style="text-align:center; display: block;"><a href="http://billpetti.com/2010/04/19/the-era-of-big-data-ibm-gets-it/"><img src="http://img.youtube.com/vi/AnL98lQdqa8/2.jpg" alt="" /></a></span>
<p>Combine sophisticated data analytics with the fact that <a href="http://www.aolnews.com/science/article/scientists-make-it-official-people-are-so-predictable/19364257" target="_blank">people, it turns out, are actually rather predictable</a> in their routines and you end up with a myriad of possibilities for business and public policy.</p>
<p>Via <a href="http://twitter.com/alexlundry/status/10735720137" target="_blank">Alex Lundry</a></p>
<p>[Note: I will be traveling over the next few days, so posting will be light-to-nonexistent]</p>
<br /> Tagged: <a href='http://billpetti.com/tag/analytics/'>analytics</a>, <a href='http://billpetti.com/tag/big-data/'>Big Data</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/1959/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/1959/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/1959/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=1959&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2010/04/19/the-era-of-big-data-ibm-gets-it/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>
	</item>
		<item>
		<title>Cloud Analytics from Big Blue</title>
		<link>http://billpetti.com/2009/11/16/cloud-analytics-from-big-blue/</link>
		<comments>http://billpetti.com/2009/11/16/cloud-analytics-from-big-blue/#comments</comments>
		<pubDate>Mon, 16 Nov 2009 13:46:20 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=1225</guid>
		<description><![CDATA[Music to analytically-driven ears: [...] IBM is unveiling a new internal analytics product that the company is touting as the “largest private cloud computing environment for business analytics in the world,” which launches internally with more than a petabyte of information. Along with this internal product, IBM will launch a companion product for clients to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=1225&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><a href="This is music to my analytically-driven ears" target="_blank">Music to analytically-driven ears</a>:</p>
<blockquote><p>[...] IBM is unveiling a new internal analytics product that the company is touting as the “largest private cloud computing environment for business analytics in the world,” which launches internally with more than a petabyte of information. Along with this internal product, IBM will launch a companion product for clients to build upon this cloud-based architecture, called IBM Smart Analytics Cloud.</p>
<p>The internal product, dubbed Blue Insight, will provide 200,000 employees in IBM’s sales and development department with the ability to extract data and information to make decisions and gain further insight at the point of sale. Blue Insight will gather information from nearly 100 different information warehouses and data stores, providing analytics on more than a petabyte (1,000 terabytes or 1,000,000 gigabytes) of data. For example, sales execs may use customizable queries of real time data to understand revenue opportunities and how many sales in their region are closing to help improve prediction. Or a manufacturing process engineer can evaluate real-time data on the plant floor to identify trends and data to improve yield and reduce shipment delivery times.</p>
<p>IBM Smart Analytics Cloud offering for clients will similarly deliver powerful business intelligence via the scalable, private cloud. The product lets the client import data and than transform this information into insights to develop strategies and decisions. The service will offer the ability to create reports, analysis, dashboards, and scorecards to monitor business performance and measure results.</p></blockquote>
<p>The key to good analytics is not just the accumulation of relevant data, but also the ability to manipulate and visualize the data in meaningful ways.  The ability to create custom dashboards, etc, is an important part of making analytics a useful and profitable tool.  IBM has certainly dedicated itself to expanding its service and software offerings relative to its traditional hardware business, and analytics has emerged as a huge part of that strategy.  In doing so, it should be well positioned to profit from the coming <a href="2009/08/07/profiting-from-an-analytically-driven-world/" target="_blank">analytically-driven</a> <a href="2009/08/14/more-on-a-data-driven-world/" target="_blank">world</a>.</p>
<br /> Tagged: analytics, Big Data, statistics <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/1225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/1225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/1225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=1225&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/11/16/cloud-analytics-from-big-blue/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>
	</item>
		<item>
		<title>Evaluating Human Capital Investments Through the Prism of Baseball</title>
		<link>http://billpetti.com/2009/10/25/evaluating-human-capital-investments-through-the-prism-of-baseball/</link>
		<comments>http://billpetti.com/2009/10/25/evaluating-human-capital-investments-through-the-prism-of-baseball/#comments</comments>
		<pubDate>Sun, 25 Oct 2009 14:00:48 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[baseball]]></category>
		<category><![CDATA[human capital]]></category>
		<category><![CDATA[momentum]]></category>
		<category><![CDATA[performance metrics]]></category>
		<category><![CDATA[Sabermetrics]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=858</guid>
		<description><![CDATA[[Fair warning: what follows is quite lengthy] Well, it is performance review time at work and this reminded me of a post I&#8217;ve been meaning to write for a while. An issue that has always interested me is how organizations measure individual performance.  Organizations have finite resources and therefore must deploy those resources in the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=858&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><em>[Fair warning</em><em>: what follows is quite lengthy]</em></p>
<p>Well, it is performance review time at work and this reminded me of a post I&#8217;ve been meaning to write for a while.</p>
<p>An issue that has always interested me is how organizations measure individual performance.  Organizations have finite resources and therefore must deploy those resources in the most efficient manner, maximizing their value.  Given the large percentage of resources invested in personnel, organizations have a huge incentive to get those investments right.  However, <a href="http://www.amazon.com/gp/product/0071422536?ie=UTF8&amp;tag=billpett-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=0071422536">calculating an accurate ROI on employees</a> is probably one of the hardest things to do.  To explore why&#8211;and how it might be done better&#8211;I turn to the world of sports, baseball in particular.</p>
<p>There has long been a debate within the <a href="http://www.sabr.org/" target="_blank">Sabermetric community</a> (and between purists and Sabermetricians) regarding the statistical relevance of &#8220;clutch&#8221;: the ability of a player to elevate their performance in key situations in a manner significantly different from their performance in normal situations.  Early research by such pioneers as Bill James found that the attribute of clutch didn&#8217;t exist&#8211;much like the idea of a &#8220;hot-hand&#8221; in basketball, the appearance of a clutch performance (e.g. a usually mediocre batter managing to hit .500 in a playoff series) was nothing more than a statistical artifact.  If you were to look at any 5-7 game stretch during the 162-game regular season you are just as likely to find even average hitters going 4 for 8 or 8 for 16 as you would be to find them going 2-8 or 0-16.  Over a long enough time period, these streaks even out and players <a href="http://en.wikipedia.org/wiki/Regression_toward_the_mean" target="_blank">regress to their mean</a> performance.  Basically, if you are a .250 hitter, over a long enough period of time your performance will return to its mean, despite occasional swings to the extreme left and right sides of the bell curve.</p>
<p>More recent studies have looked to expand upon earlier research and refine how we search for clutch performances.  A common way to do this is to not look at single games that were more important (e.g. post-season play), but rather particular <em>moments </em>that alter the probability of a team winning <em>that particular game</em>.  This approach has been termed &#8220;<a href="http://www.hardballtimes.com/main/statpages/glossary/#li" target="_blank">leverage</a>&#8220;:<span id="more-858"></span></p>
<blockquote><p>the <strong>swing</strong> in the possible <strong>change</strong> in win probability. If there is a game with one team leading by ten runs, the possible changes in win probability, whether the event is a home run or a double play, will be very close to negligible. That is, there won&#8217;t be much swing in any direction.</p>
<p>But, in a late and close game, the change in win probability among the various events will have rather wild swings. With a runner on first, two outs, down by one, and in the bottom of the ninth, the game can hinge on one swing of the bat—a home run and an out will both end the game, but with vastly different outcomes for the teams involved.</p></blockquote>
<div class="wp-caption alignleft" style="width: 310px"><img title="Via ESPN.com" src="http://a.espncdn.com/photo/2008/1001/pg2_g_rodriguez_jeter_300.jpg" alt="From left to right: A-Rod and Jeter" width="300" height="300" /><p class="wp-caption-text">From left to right: A-Rod and Jeter</p></div>
<p>An excellent example of this debate over which players are clutch and which are not is <a href="http://blogs.wsj.com/numbersguy/alex-rodriguez-clutch-hitter-411/" target="_blank">the case of Alex Rodriguez</a>.  Long touted as one of the greatest hitters of all time, A-Rod has consistently been criticized as a non-clutch performer, accused of accumulating the bulk of his stats in low-leverage (i.e. low-pressure) situations.  One could not listen to NY sports radio over the past 5 years without hearing fans decry the &#8216;superficial&#8217; performance of the Yankee slugger while praising the clutchness of his statistically less impressive teammate, Derek Jeter.  Part of the reason for this perception is <a href="http://mlb.mlb.com/media/video.jsp?mid=200807113113172" target="_blank">the effect of high-impact events</a> on perception and the framing of the debate by the media.  But when we look at both players&#8217; statistics in leveraged situations <a href="http://blogs.wsj.com/numbersguy/alex-rodriguez-clutch-hitter-411/" target="_blank">a much different picture emerges</a>:</p>
<blockquote><p>[As of 2008] Mr. Rodriguez has hit for the clutch throughout his remarkable, surefire Hall of Fame career. His career OPS in high-leverage situations is .975. In medium-leverage, it’s .960. And in low-leverage, it’s .972. That’s consistent with the American League as a whole during his career, when each year batters in high-leverage situations hit somewhere between 1% worse and 6% better than they did in low-leverage situations.</p></blockquote>
<p>Additionally, if you look at the playoffs (where most look for clutch performance), A-Rod has accumulated the following statistics (including this year): .299BA/.393OBP/.958OPS.  Here&#8217;s Jeter&#8217;s: .309BA/.379OBP/.858OPS.  In series where his team is playing for a spot in the World Series, A-Rod&#8217;s stats are even better&#8211;Jeter&#8217;s go south.</p>
<p>The point is not to cheerlead for  Alex Rodriquez, but rather to point out two points that can be applied to any organization and field when thinking about the value of personnel:</p>
<ol>
<li>When evaluating performance we should pay attention to results over the long term, not an arbitrary chunk of time.  Anyone can &#8216;get hot&#8217;, but that doesn&#8217;t mean the individual willed themselves to a better performance or that current success is a predictor of future success.  Discrete outcomes are quite dependent on factors outside of an individual&#8217;s control as well as the randomness of performance.  This works both ways&#8211;for successes and for shortcomings.</li>
<li>Single, high-impact events can significantly skew our perspective.  Objectively evaluating the quality of individuals is difficult since we tend to use anecdotes as mental shortcuts.  Big wins are important, but they shouldn&#8217;t blind us to the overall performance of an individual.  Those big moments could easily be the result of random chance rather than talent.  Now, we all benefit from luck from time to time.  However, much time and treasure has been wasted on individuals who &#8216;came up big&#8217; at one time and management is just waiting for them to rise to the occasion yet again.  This applies equally to &#8216;hot&#8217; baseball prospects, CEO&#8217;s, coaches, sales personnel, programmers, managers, etc.</li>
</ol>
<p>So what should we do?  How do we better measure human capital performance?  I think partly we should take a page from baseball.  New measurements have emerged over time to better evaluate player performance.  Taking an analytical approach to measuring performance is step one.  It shouldn&#8217;t be the only measurement, but without appropriate data organizations are simply basing their evaluations on subjectivity and intuition&#8211;you need a balance.  With new measurements comes a new mindset; the prism through which we view the world changes so that instead of looking for those &#8216;big moments&#8217; we now look for consistent performance over the long term.  A healthy appreciation for randomness, chance, and methods for accurately measuring performance are all good first steps towards more accurately evaluating human capital.</p>
<p>For those that made it this far, thanks for sticking with me on this one.  Would love to hear your thoughts.</p>
<br /> Tagged: analytics, baseball, human capital, momentum, performance metrics, Sabermetrics, statistics <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/858/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/858/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/858/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=858&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/10/25/evaluating-human-capital-investments-through-the-prism-of-baseball/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>

		<media:content url="http://a.espncdn.com/photo/2008/1001/pg2_g_rodriguez_jeter_300.jpg" medium="image">
			<media:title type="html">Via ESPN.com</media:title>
		</media:content>
	</item>
		<item>
		<title>“Science these days has basically turned into a data-management problem&#8221;</title>
		<link>http://billpetti.com/2009/10/14/%e2%80%9cscience-these-days-has-basically-turned-into-a-data-management-problem/</link>
		<comments>http://billpetti.com/2009/10/14/%e2%80%9cscience-these-days-has-basically-turned-into-a-data-management-problem/#comments</comments>
		<pubDate>Wed, 14 Oct 2009 11:17:59 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data visualization]]></category>

		<guid isPermaLink="false">http://billpetti.com/?p=786</guid>
		<description><![CDATA[So says Professor Jimmy Lin at the University of Maryland in a recent NYT Technology article about the shortfall in &#8220;Big Data-competent&#8221; university students.  The article points out that the kind of data we are now dealing with (which will only continue to increase exponentially) requires a different perspective and experience than most currently have.  [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=786&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>So says Professor Jimmy Lin at the University of Maryland in a recent <a href="heading to NYC, listening to John Mayer Trio's &quot;Another Kind of Green&quot;" target="_blank">NYT Technology article</a> about the shortfall in &#8220;Big Data-competent&#8221; university students.  The article points out that the kind of data we are now dealing with (which will only continue to increase exponentially) requires a different perspective and experience than most currently have.  Firms that have a vested interest in workers with these skills, such as Google and I.B.M., have partnered with universities in an effort to change the frame of reference for students.</p>
<p>This underscores the <a href="http://billpetti.com/2009/08/07/profiting-from-an-analytically-driven-world/" target="_blank">comparative</a> <a href="http://billpetti.com/2009/08/14/more-on-a-data-driven-world/" target="_blank">advantage</a> of <a href="http://billpetti.com/2009/08/22/challenges-of-consuming-real-time-data/" target="_blank">individuals</a> with <a href="http://billpetti.com/2009/09/11/the-soft-sciences-to-get-their-day/" target="_blank">skills</a> amenable to collecting, coding, manipulating, and visualizing Big Data in the current labor market.  Additionally, as not all data will be easily collected and coded via computer programs, <a href="http://billpetti.com/2009/09/16/crowdflower-live-from-techcrunch50/" target="_blank">services</a> that can efficiently harness <a href="http://www.google.com/url?q=https://www.mturk.com/mturk/welcome&amp;ei=w2rUSsH_GNPd8Qa0kPSHDQ&amp;sa=X&amp;oi=spellmeleon_result&amp;resnum=1&amp;ct=result&amp;ved=0CAkQhgIwAA&amp;usg=AFQjCNEX45pwKPZxFM7TogCuzTVWBdmEDg" target="_blank">the crowd</a> in support of Big Data will also be critically important.</p>
<p style="text-align:center;">
<div class="wp-caption aligncenter" style="width: 388px"><a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_visualizing"><img class=" " src="http://www.wired.com/images/article/magazine/1607/pb_visualizing_f.jpg" alt="A visualization of thousands of Wikipedia edits that were made by a single software bot. Each color corresponds to a different page." width="378" height="352" /></a><p class="wp-caption-text">A visualization of thousands of Wikipedia edits that were made by a single software bot. Each color corresponds to a different page.</p></div>
<p><em>Photo via <a href="http://www.wired.com/science/discoveries/magazine/16-07/pb_visualizing" target="_blank">Wired</a></em></p>
<br /> Tagged: analytics, Big Data, crowdsourcing, data, data visualization <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/786/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/786/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/786/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=786&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/10/14/%e2%80%9cscience-these-days-has-basically-turned-into-a-data-management-problem/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>

		<media:content url="http://www.wired.com/images/article/magazine/1607/pb_visualizing_f.jpg" medium="image">
			<media:title type="html">A visualization of thousands of Wikipedia edits that were made by a single software bot. Each color corresponds to a different page.</media:title>
		</media:content>
	</item>
		<item>
		<title>Challenges of Consuming Real-time Data</title>
		<link>http://billpetti.com/2009/08/22/challenges-of-consuming-real-time-data/</link>
		<comments>http://billpetti.com/2009/08/22/challenges-of-consuming-real-time-data/#comments</comments>
		<pubDate>Sat, 22 Aug 2009 12:22:42 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>

		<guid isPermaLink="false">http://billpetti.wordpress.com/?p=351</guid>
		<description><![CDATA[I&#8217;ve run across quite a few stories lately discussing the 1) the revolution in data production we are living through and 2) the challenges we face in being able to sift through and view that data in a meaningful way through the web. The first comes from GigaOM, where Jennifer Martinez looks at the emerging [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=351&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve run across quite a few stories lately discussing the 1) the revolution in data production we are living through and 2) the challenges we face in being able to sift through and view that data in a meaningful way through the web.</p>
<p>The first comes from GigaOM, where Jennifer Martinez <a href="http://gigaom.com/2009/08/19/the-real-time-web-sifting-required/" target="_blank">looks at the emerging problem of trying to keep up with the constant flow of data</a> via status updates.  As our networks grow, and our use of various social networks increases, we are inundated with updates which often times leads to missing particular updates that we may be most interested in.  Additional, she notes that besides missing out on information you care about, this stream overload can lead to &#8220;<a href="http://gigaom.com/2009/08/13/the-evolution-of-blogging/">disjointed conversations</a> that <a href="http://gigaom.com/2008/11/28/with-twitter-a-desperate-need-for-context/">lack context</a>, making it hard to piece together and decipher what it all means&#8221;.  I can relate to this problem, and my &#8216;immersion&#8217; in social networks is average to above average.  I haven&#8217;t figured out an optimal way to keep up.  I try to utilize a few useful tools (e.g. Seesmic), but between social networks and Google Reader I find myself constantly playing catchup.</p>
<p>Michael Driscoll at Dataspora follows up on this theme providing a more high-level discussion of <a href="http://dataspora.com/blog/the-rise-of-the-data-web/">how the rise of data (vs. documents) conflicts with the architecture that underlies the web today</a>.  Current mark-up languages are geared towards, and ideal for, documents (e.g. HTML and XML), not the kind of streaming data that will come to dominate content.  To explain this point he provides a comparison of metaphors where documents=trees and data=streams:<br />
<span id="more-351"></span></p>
<blockquote><p>Trees are rooted and finite: you can’t chop up a tree and easily put it back together again (while XML has made concessions to <a href="http://www.w3.org/TR/xml-fragment">document fragments</a>, it is not a natural fit).</p>
<p>Streams can be split, sampled, and filtered. The divisibility of data streams lends itself to parallelism in a way that document trees do not. The stream paradigm conceives of data as extending infinitely forward in time. The Twitter data stream has no end: it ought have no end tag.</p>
<p>Conceiving of data as streams moves us out of the realm of static objects and into the <a href="http://mitpress.mit.edu/sicp/full-text/book/book-Z-H-24.html#%_sec_3.5">realm of signal processing</a>.  This is the domain of the living: where the web is not an archive but an organism.</p></blockquote>
<p>Finally, Ben Lorica at O&#8217;Reilly Radar discuss <a href="http://radar.oreilly.com/2009/08/big-data-and-real-time-structured-data-analytics.html">the challenges with trying to analyze large amounts of data in near real-time</a>.  As there are a number of potential solutions for the structured data that we are generating, there is a less obvious way to deal with the immense unstructured data.  He notes recent work by a team at UC Berkeley that was able to take unstructured data and, leveraging <a href="http://en.wikipedia.org/wiki/Named_entity_recognition">entity extraction</a>, turned it into structured data for a SQL database.</p>
<br /> Tagged: analytics, Big Data, data <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/351/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/351/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/351/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=351&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/08/22/challenges-of-consuming-real-time-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>
	</item>
		<item>
		<title>Personal Analytics</title>
		<link>http://billpetti.com/2009/08/17/personal-analytics/</link>
		<comments>http://billpetti.com/2009/08/17/personal-analytics/#comments</comments>
		<pubDate>Mon, 17 Aug 2009 10:10:21 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[personal analytics]]></category>

		<guid isPermaLink="false">http://billpetti.wordpress.com/?p=225</guid>
		<description><![CDATA[For those that are interested the &#8220;living by numbers&#8221; and personal metrics trend, be sure to check out your.flowingdata. Your.flowingdata is a great interface that allows you to track your daily behavior through quick, direct messages on twitter.  Want to track how far you walk, how much you eat, how often you spend online versus [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=225&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p>For those that are interested the &#8220;<a href="http://www.wired.com/medtech/health/magazine/17-07/lbnp_nutrition" target="_blank">living</a> <a href="http://www.wired.com/medtech/health/magazine/17-07/lbnp_exercise" target="_blank">by</a> <a href="http://www.wired.com/medtech/health/magazine/17-07/lbnp_health" target="_blank">numbers</a>&#8221; and <a href="http://www.wired.com/medtech/health/magazine/17-07/lbnp_knowthyself" target="_blank">personal metrics</a> trend, be sure to check out <a href="http://your.flowingdata.com/" target="_blank">your.flowingdata</a>.</p>
<p>Your.flowingdata is a great interface that allows you to track your daily behavior through quick, direct messages on twitter.  Want to track how far you walk, how much you eat, how often you spend online versus reading printed media, etc?  The program allows you to interact with the data in a number of helpful ways and, if you like, share your data with friends or download the data and <a href="http://flowingdata.com/wp-content/uploads/yapb_cache/picture_2.8y3glr5fxa0w8wsww80g48w80.9xgbnoosx68s4ok4sw448cwc4.th.jpeg"><img class="alignleft" src="http://flowingdata.com/wp-content/uploads/yapb_cache/picture_2.8y3glr5fxa0w8wsww80g48w80.9xgbnoosx68s4ok4sw448cwc4.th.jpeg" alt="" width="280" height="220" /></a>conduct your own analysis.  Nathan (the creator of FlowingData.com and, of course, your.flowingdata) <a href="http://flowingdata.com/2009/08/07/your-flowingdata-update-share-data-and-set-reminders/" target="_blank">has a post detailing several updates to  the program since the July launch</a>.</p>
<p>Like any data analysis exercise, the payoff can be uncovering patterns in your behavior that you simply weren&#8217;t aware of since we a) tend to lose perspective in the moment,  and b) we also our memories tend to be biased (i.e. recalling more recent events prominently, or emphasizing extreme events over mundane ones).</p>
<p>The key is that the program also provides various visualization tools so that you can very easily view the patterns as they emerge.  This kind of feedback is critical if you are serious about altering your behavior.  Maybe you eat too much, too late at night, aren&#8217;t getting enough sleep, spending too much time online.  In some cases you likely have an idea that your behavior is out of whack, but in other cases you may not realize the impact that certain behaviors are having.  Visualizing your personal metrics can be a real eye-opener and help you to make productive changes in your life.</p>
<br /> Tagged: analytics, data, data visualization, personal analytics <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/225/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=225&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/08/17/personal-analytics/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>

		<media:content url="http://flowingdata.com/wp-content/uploads/yapb_cache/picture_2.8y3glr5fxa0w8wsww80g48w80.9xgbnoosx68s4ok4sw448cwc4.th.jpeg" medium="image" />
	</item>
		<item>
		<title>Linkage for Sunday</title>
		<link>http://billpetti.com/2009/08/16/linkage-free-analytics/</link>
		<comments>http://billpetti.com/2009/08/16/linkage-free-analytics/#comments</comments>
		<pubDate>Sun, 16 Aug 2009 18:49:02 +0000</pubDate>
		<dc:creator>Bill Petti</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[data visualization]]></category>

		<guid isPermaLink="false">http://billpetti.wordpress.com/?p=205</guid>
		<description><![CDATA[Dan at Predictably Irrational further discusses the results of his FREE! experiment.  Basically, the simple fact that something is made available free creates a preference for that good even when the relative difference in price between that good and a presumably better one stays constant.  However Dan recounts the results of some different variations on [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=205&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<ul>
<li>Dan at Predictably Irrational <a href="http://www.predictablyirrational.com/?p=653" target="_blank">further discusses the results of his FREE! experiment</a>.  Basically, the simple fact that something is made available free creates a preference for that good even when the relative difference in price between that good and a presumably better one stays constant.  However Dan recounts the results of some different variations on the experiment and notes that the change in demand was not constant</li>
<li>USA Today <a href="http://www.usatoday.com/news/washington/presidential-approval-tracker.htm" target="_blank">has launched a Presidential approval ratings tracker</a>.  The tracker provides data from the Gallop Presidential Approval poll starting with Truman in the 1940&#8242;s.  You can view the previous 12 presidents in chronological order, isolate specific presidents, or compare them.  The tool provides the exact dates from the polls and the breakdown of approve, disapprove, and no opinion.  Via <a href="http://flowingdata.com/2009/08/05/track-presidential-approval-ratings-and-compare-to-past/" target="_blank">FlowingData.com</a>.</li>
<li><a href="http://blog.jonudell.net/2009/07/31/polymath-equals-user-innovatio/" target="_blank">Crowdsourcing answers</a> to the toughest problems in mathematics.</li>
</ul>
<br /> Tagged: analytics, crowdsourcing, data, data visualization <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/billpetti.wordpress.com/205/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/billpetti.wordpress.com/205/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/billpetti.wordpress.com/205/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=billpetti.com&amp;blog=8839193&amp;post=205&amp;subd=billpetti&amp;ref=&amp;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://billpetti.com/2009/08/16/linkage-free-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/83d0c69bc078d64ebe36a701cbf755b2?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96&#38;r=G" medium="image">
			<media:title type="html">billpetti</media:title>
		</media:content>
	</item>
	</channel>
</rss>
