10 Places for Market Researchers to Learn about Data Visualization

Data Visualization Infographic

Have you ever sat through an interminable PowerPoint presentation about market research data? How did it make you feel?

I thought so.

We all could stand to learn ways to present data in a parsimonious and beautiful way. After all, as researchers, we are in the business of communication about data.

I recommend you keep up on the dynamic world of data visualization.

Here are 10 links that will help you get started understanding different aspects of this fascinating discipline.

  1. Bitsybot is a really cool site by GfK Custom Research’s data wiz Bitsy Hansen.
  2. The good folks at O’Reilly Media have an extensive collection of data visualization resources.
  3. Information is Beautiful is by “data journalist” David McCandless. The name says it all.
  4. There is a LinkedIn Group dedicated to data visualization in market research called Market Research Data Visualization.
  5. I Love Charts presents humorous made-up charts which often manage to communicate real lessons about the presentation of data, whether intentionally or not.
  6. Chart Porn…’nuff said.
  7. ChartChannel from iCharts has interactive charts you can clip and use in your blog.
  8. Visual Complexity focuses on the representation of complex networks.
  9. Into maps as well as data? Strange Maps has you covered.
  10. Flowing Data has lots of great visualizations.

Enjoy!

Image Credit

Kristin Luck on Women in Research

WIRe LogoLiving in Los Angeles in 2007, Kristin Luck had a conversation over cocktails with a group of other women working in market research.

They quickly found they agreed on the need for women in the market research profession to support one another.

The result was a networking group for LA-area women in market research called WIRe (Women in Research).

Luck gives particular credit to Elaine Coleman, now of Resolve Market Research. Coleman was new to Los Angeles at the time and was interested in meeting other women in research.

The group started out as an informal cocktail hour but has slowly grown to host more formal events.

“The mission for the group,” Luck said, “is to encourage empowering and nurturing relationships among women in market research.”

The group does that through a combination of informal get-togethers, networking events, and a new mentoring program launching in 2012.

Since 2007, the organization’s presence and scope have grown. In 2011 WIRe grew significantly, launching a website, holding more frequent events, expanding beyond Los Angeles, and welcoming more members to its Facebook presence.

Today the group’s networking meetings are held quarterly in Los Angeles and New York, and their first event in London is scheduled for May 24th.

“It’s exciting to see how the group has evolved and grown over the last few years,” Luck said.

Luck continues to run the organization along with Cassandra Rowe, Senior Manager of Consumer Insights at Netflix.

Kristin Luck

Kristin Luck

After stints at Lieberman Research and ACNielsen, Luck co-founded a highly successful market research technology firm, OTX Research (now Ipsos OTX MediaCT) along with Shelley Zalis. Her second technology venture, Foresight Consulting Group, was acquired by Decipher in 2007, and she has led that firm’s growth as president since that time.

As Luck rose through the market research industry ranks, she experienced a change in her surroundings. She said that market research is an industry that’s traditionally been dominated by women, but those women tend to be in lower to middle management positions. Luck pointed out that looking at the Honomichl 50, the CEOs who are women are a small percentage as compared to their proportion in the industry as a whole.

“I think it’s really important,” said Luck, “if you’re a woman in a senior management position – we’re all busy and we all have a lot going on – but it takes so little time just to give back to somebody and help them make that next step or give them advice.”

Luck says she is often asked about inclusion of men in the group. “We haven’t excluded men from the discussion or from any of our events. Even though our focus is on empowering and nurturing women in research, we’re a man-friendly group. I think it’s really important that men are part of the dialogue about women in research.”

WIRe events are purposefully kept informal, with no speakers and no sales pitches. Only recently has WIRe brought on sponsors. Luck cited ESOMAR as a key supporter.

The most formal presentation in recent meetings has been a screening of a film called “Miss Representation,” a documentary about how women are portrayed in the media, business and politics.

Luck said one of the lines in the film really resonated with her: “You can’t be what you can’t see.”

“It’s tough as a woman to understand what your career path is,” Luck said, “and how to get into a senior management position if you don’t see women in those roles, and you’re not being actively mentored and nurtured by other women that are in those positions.”

Asked what she would say to up-and-coming women in market research, Luck said she would advise “paying attention to your personal brand and getting passionate about something and figuring out what you love about the industry.  I think that you naturally excel at things that you love doing. You really have to stay focused on keep your eye on what the next step is and it’s important not to get pigeonholed in any one place. There comes a time where you have to leave and do the next thing.”

You can find WIRe at womeninresearch.com and you can follow them on Twitter with the handle @womeninresearch.  

What Market Researchers Need to Know About SoLoMo

Do you know everything you need to know about SoLoMo, the convergence of the social, the local and the mobile?

Could you learn something from Charlie Rader of Procter & Gamble, Steve Rappaport of ARF (and author of Listen First!), and Andrew Jeavons of Survey Analytics?

I thought so!

Well, it’s not too late to sign up for today’s webinar, “SoLoMo: How Social Media, Localization, & Mobile are Redefining Marketing Insights” at 1pm Eastern / 10am Pacific, featuring Charlie, Steve and Andrew and moderated by GreenBook‘s Lenny Murphy.

This session is the second in a series of webinars about big ideas in market research brought to you by Research Access and GreenBook.

Research Access - GreenBook

If you are not able to attend live, sign up anyway and we will send you a link to the video of the webinar and a copy of the slides.

Don’t forget to tweet your questions to the hashtag #mrxideas.

Click this link to sign up for the webinar.

#mrxideas

An Interview with Snapdeal CEO Kunal Bahl

Kunal Bahl

I recently had the honor and the pleasure of interviewing Kunal Bahl, the CEO of Snapdeal.com, perhaps the hottest c-commerce company in India.  If you are not familiar with the Indian e-commerce market, I recommend this recent article from Forbes magazine, “Startup India Flips Out over E-Commerce,” which mentions Snapdeal prominently.  We touched on a wide range of subjects, and of course, I asked him about how he uses data in his business.

Dana Stanley:  For those readers who may not be familiar, please tell me about your company.

Kunal Bahl:  Snapdeal.com is one of the fastest growing e-commerce companies in India. Launched in February 2010, the company is the largest online marketplace for Products and Services, featuring a wide range of products and services across an array of categories from thousands of national, international and regional brands. The company is 1000+ people strong and is the largest e-retailer of local merchant deals, watches, sunglasses, jewellery, perfumes, personal care, among other categories, delivering to 5000+ cities and towns in India. 1 out of every 7 internet users in India is subscribed on Snapdeal, and the company is adding over a million new subscribers per month.

Dana Stanley:  That’s amazing.  How is SnapDeal different from and how is it similar to companies like Groupon and LivingSocial?

Kunal Bahl:  Snapdeal.com started as a local deals site, featuring businesses across categories like Dining, Health & Beauty, Entertainment among others. Various innovations like part payment model, eliminating minimum buyer cap for a deal to go live, mobile vouchers etc. were introduced to customize the platform for Indian audience. Over the course of last 2 years, Snapdeal also introduced Products in addition to the services deals, and has emerged as the largest market place for brands and businesses in India.

The powerful model of local merchant offers along with physical product e-commerce is something which is very unique to Snapdeal.com and it gives the opportunity to provide wider variety of choice to the customers. Through the use of smart recommendation engines, we also cross sell services deals with products which is a very powerful combination.  This controlled Products and Services marketplace model, keeps the business lean and also enables rapid expansion of assortment while delivering a great customer experience, given the higher relevance due to greater choice.

SnapDeal LogoDana Stanley:  What are the challenges and rewards of operating an e-commerce business in India?

Kunal Bahl:  Challenges are predominantly in the area of building infrastructure to deliver a superior customer experience. Right from reliable payment gateways to efficient supply chain partners, there is a lot of scope for improvement. More and more customers will start transacting online, as and when these challenges are addressed.

With respect to the rewards, there is a huge potential for the growth of e-commerce in India. Compared to 35% internet transactor penetration out of all Internet users in China, the number is just 5% in India. Also, in spite of accounting for 17% of the world population, only 4% of the Internet users account from India. Given, that innovative models like Snapdeal, within the e-commerce space can scale very fast, it is a great opportunity. In addition to the numerous brands, there are more than 17 million Small and Medium enterprises, providing goods and services to the end consumers, only in the Offline space. E-Commerce as a platform can bridge the gap between these businesses and end consumers.

Dana Stanley:  In what ways does data guide your business?

Kunal Bahl:  We use technology and strong analytics to drive pretty much every decision in the company. Using analytics, we are able to build and optimize strong recommendation engines that learn and suggest what users want next. We use world class technology and data driven tools to monitor and improve various spends and performance of Marketing, Customer care, Sales force operations and a host of backend operations like Supply chain management and Website performance.

Dana Stanley:  How do you see the e-commerce market in India evolving?

Kunal Bahl:  The internet industry is currently one of the fastest growing sectors in India and has huge potential given that, there is a rapid increase in not only internet penetration but also people who have access to credit cards, and disposable income to spend online.
Out of the 100 million Internet users currently, a minute percentage buys goods and services online. Within the next 2-3 years, a significant chunk of this audience will transact online. There is a lot of scope in enabling e-commerce through Mobile as well, given the numbers are significantly higher for the same.

Dana Stanley: What’s next for you and for SnapDeal?

Kunal Bahl:  We will continue to grow our footprint across India. In addition to geographic expansion, Snapdeal will continue to increase the assortment of products across various categories. Our objective is to build a robust platform which will be the biggest marketplace for products and services. Brands and Small and Medium enterprises alike, who have products and services, targeted towards end users, will be able to reach out to a very targeted audience through our platform. For consumers, it will be a great platform for discovery of goods and services not only within the city, but across the country.

Fewer Than Half Know How to Protect Privacy Online

Online PrivacyThe good folks at the Pew Internet & American Life Project have done it again.

They have the substantial mandate to chronicle the use of the internet and technology among Americans.  As part of that responsibility they publish regular reports which do a good job of measuring various technology related phenomena – some of which is less than exciting, but they provide a good service by putting numbers to things about which others merely guess.

However, it seems that in every report they release, there is at least one nugget of information which surprises, provokes thought, and even inspires.

Their latest report, Search Engine Use 2012, by project director Lee Rainie, Kristen Purcell and Joanna Brenner, contains just such a nugget.

They asked internet users the following question:

Are you aware of any ways Internet users like yourself can limit how much personal information websites collect about you, or are you not aware of any ways to do this?

Only 38% said they knew how to limit the personal information collected by websites!

Pew Internet Privacy Chart

Think about that for a moment.  Three of five American internet users don’t even know what to do to protect their privacy, even if they have privacy concerns.

Further, only 75% of those who use any method of privacy protection whatsoever have actually used the privacy settings of the websites they visit.

This data is a sobering reminder that technology providers, marketers and market researchers all have a responsibility to educate the consumers with whom they interact, be they customers, prospects or survey respondents.

To be sure, if internet users are educated about privacy issues, they will be more likely to seek privacy solutions.  At a minimum, however, we need to make it very clear to them how they can choose various privacy levels.

How do you approach privacy when working with respondents or customers? Share your thoughts in the comments section below.

Market Research is Beating Marketing Research

Market Research or Marketing ResearchIn a recent post I posed a question: “Which term do you prefer: market research or marketing research?” In it I gave readers a chance to vote on which term they prefer, or to choose “no preference.”

As of this writing, with 27 votes, “market research” is beating “marketing research” 56% to 30%, with 15% choosing “no preference.”  Full disclosure: I cast my vote for “market research.”

This is all well and good, but I’d like to see some more results before coming to any conclusion.

Please “weigh in” (cue rimshot) on the “market research” vs. “marketing research” MicroPoll, which I’ve re-embedded below, as well as in the comments section at the end of the post.

Start Learning to Become a Data Scientist

Data Scientist

With the advent of the web and increased business process automation, companies are increasingly finding themselves inundated with vast troves of data.  Making sense of and extracting insights from this data is a job that requires a wide range of skills – from the technical to the inferential – which are seldom found together in a single professional.  This is the job of the Data Scientist – a job that most companies of significant size will need to fill in the coming years.

The following is a  very insightful exchange about the Data Scientist role from a recent webinar sponsored by Research Access and GreenBook on the subject of Big Data.  Four expert panelists contributed to the discussion:

I hope you enjoy this interesting and insightful exchange as much as I did.

Steve Cohen: Just as “Big Data” has been thrown around, there’s a new term thrown around called the “data scientist.”  So the question is I’ve seen it kinda pop up in a lot of blogs and discussions and so on about what a data scientist is.  Again, the best definition of a data scientist I’ve seen, it’s somebody who has really three skills.  The three skills are statistical skills – that is, you can model data and extract value from a model.  The second one is somebody who has hacking skills.  I don’t mean hacking in the bad sense of the word but hacking in terms of somebody who could be presented with a data set and, no matter what form it’s in, can pull it into a form that you can do modeling on.  But the third and most important part is somebody who understands the subject matter and can extract the value from the information.  That’s a very rare person.  So if there are any people listening right now who have those three skills, you’re gonna be quite valuable over the next 10 to 20 years or so because there aren’t many of you out there; trust me.

Romi Mahajan: Update your resume and get on LinkedIn if you want a job.

Steve Cohen: Don’t worry.

Lenny Murphy: We recently predicted that there’s a gap of something like 100,000 data scientists needed.  I’m not sure if they used that definition that you used, Steve, but I think it’s a good one.  And it’s a real issue.  Let’s talk about that for a second right now.  I think as technology is evolving and the vision of being able to deliver Big Data from an etiological standpoint is coming to fruition, there’s a human capital gap that I think may become fairly critical in pretty short order.

What the hell do you do with this?  What does it really mean?  How do we extract real value and deliver real action, as Charlie said, from this?  That’s going to require a real retooling both from an educational standpoint, from an HR standpoint, from a policy standpoint in how organizations look at and identify the whole structure to support this type of model.

Steve Cohen:  Let me react to that quickly.  I just wanna say I’ve already had two business school professors I know contact me after I wrote that information you just talked about, Lenny, who both said to me, please talk to us about how should we be retooling our undergraduate and graduate programs in order to meet the needs of data scientists in the future.  So it’s starting to get on the radar screen of the universities.

Lenny Murphy:  That’s good, because they don’t do well with market research.  So it’s glad they’re coming on board with that.  They’re still teaching stuff from 50 years ago on the MR industry.  I’m really glad to hear that, Steve.

Charlie Wardell:  Lenny, in the Big Data space and in the opportunity that we have right now with the data scientists, let’s not forget that it’s a slightly different space than the Business Intelligence of old.  If you look at the data warehouses just over the last 10 or 15 years, I think it’s a real scary word to a lot of CIOs, because 50% to 70% of the data warehouses or enterprise data warehouse initiatives fail.  When you look at the reasons for it, it’s obvious why, but no one’s really cracked the code in what makes a successful data warehouse.

Now we’re amping up the problem domain.  We’re dealing with enormous amounts of data, and we need to process it very quickly, but the insights that are gleaning is really at the scientific level.  These social scientists and cognitive psychologists are looking at the data, these big volumes of data, and then the relational side of the data.  So, yeah, it’s a huge market for these types of data scientists, but we just need to make sure that it doesn’t become the next Business Intelligence of Big Data.  We’ve really got to drive usable insights.  One of the real reasons why the data warehouse fell is lack of adoption.  Nobody trusts the data.  Nobody’s using the platform.  Nobody’s using the insight.  So not only do we have to glean the insights, we have to convince other people that the insights that we’ve gleaned are real and accurate and actionable.

“Big Data” – Peril or Opportunity?

Peril OpportunityOn February 21st I moderated the inaugural webinar in a series of sessions about big ideas in market research.  The series is being co-produced and sponsored by Research Access and GreenBook

The first webinar was on the topic of “Big Data.” We assembled an expert panel for this event, including:

I asked the panelists whether they thought Big Data is a peril or an opportunity.

Dana Stanley:  Is Big Data more a peril or more an opportunity?

Romi Majahan:  Let me go ahead and take a crack at that.  In my definition of Big Data, I sort of talked about both the peril and the opportunity.  So let me start with the opportunity.  I mean Big Data allows you to find patterns and find wisdom in a set of information.  So whether it comes to figuring out how diseases spread or how to attack specific germs or viruses that sort of plague humanity, whether it’s influenza or whatever, whether it comes to trying to figure out how to really gauge the attitudes consumers have about a specific product, there’s incredible opportunity that can be found in the emerging patterns in Big Data.  Similarly, the other face of that coin is that the peril is that there is so much data, that there are so many ways to interpret it, there is this issue of velocity and vastness.  There’s a problem of misinterpretation or false interpretations of data.  So big data allows us in some ways to both sort of capitalize on the opportunity but also sort of succumbs to the perils as well.  I think it’s incredibly important, therefore, to have both a programmatic way of dealing with big data but also a somewhat scientific knowledge of what makes Big Data sort of new and different than sort of the streams of data that we used to have.  So I believe there’s a peril and an opportunity, which mimics the Chinese character, of course, for opportunity, which has peril on one side and opportunity on the other side.

Steve Cohen:  I’d like to take that, also.  I mean I think that a good example is that you can look at, let’s say, how sales are doing over time for a particular product and you can look at those sales by geography or look at it by who the retailer is and so on.  But if you’re starting to look at why does a particular skew, let’s say, the small skew versus the big skew, how do sales of that particular skew look in a rural area that is serviced by a particular retailer in a particular time of the year or the holiday season or so on.  Now you’re going down and you’re finding some very micro kinds of analyses that you can do drill-downs on very easily.  The problem is, if you now want to start making generalizations about what works when and where and how, you have to start modeling that data and extracting value from it.  So the Big Data is to be able to look at a multidimensional array of potential predictors and factors that could cause sales to go up or to go down.  As I say, you can drill down as far as you want on any of these kinds of things with a big dataset.  The issue is to be able to make generalizations that you can use to generate policy for your company and directions in which you should go by doing analytics on them.  So the – where we come from is that you’ve got to not only have the data but you also have the ability to summarize and make generalizations by using analytics on them.

Charlie Wardell:  I mean we haven’t even scratched the surface of the scratch yet with the opportunity of data, the insights that can be gleaned out of data.  I think people want more data, not less.  So the opportunity is slated for $1.5 billion in 2012 for Big Data opportunities as far as the tool vendors.  So this is the year where there there’s gold in those hills and everybody’s rushing to it.  That’s the Big Data paradigm right now.  The peril, or the things to look out for, is, ironically, information overload, analysis paralysis by data.  We really need some real good insights, real good analytics.  And the challenge is that how do we do that in a fast and efficient way, right?  If you have the ability to analyze data very quickly, you have – and recursively and you’re able to analyze data over and over again within the same period of time, you can get really insightful analytics.  That’s typically what needs to happen when you’re working with large volumes of data.  The real-time nature is something that I’m really strong on, but the one thing that could be the peril is that you have a whole bunch of people out there, they’re drinking the Kool-Aid of Big Data in an architectural and methodology approach, which is a batch-based, monolithic, top-down architecture where you’re dealing with large volumes of data and, in order to get scale and redundancy, you have literally thousands of servers.  And then they’re gonna find out, well, what do you do if I need real-time access to that data or if I want to act on streaming data.  The whole thing about data is not only the insights and the analytics but it’s the end result, which is action.  What do I do?  And the people who are able to make that action quicker than others are going to seize on the opportunity.

Romi Mahajan:  I want to just pop in one thing because, quoting Alexander Pope about a little learning being a dangerous thing, when I think about Big Data and when I look on the peril side, without knowledge of how to interpret it, without knowledge of where data comes from and how to analyze it and what conclusions to draw from it, we’re creating more surface area to get it wrong as well.  You can take data, you can make some inferences from it, but you can also get it wrong because there’s so much data.  So I really appeal to people to not simply bandy about the term “Big Data” but to really seek deep knowledge about data before they make plans based on what they believe is a correct interpretation of Big Data.

Lenny Murphy:  I would add onto that, but I think the peril really is to get so caught up in the idea that we have these advanced algorithms with so much data that it answers all the questions.  I don’t believe that we’re going to see that happen.  The context is always going to be important, as well as the business issue.  So from that standpoint, even though we can create the most advanced algorithms in the world to be able to make these relational connections between data sets and we can see the patterns, et cetera, et cetera, it’s still gonna require good, sound business acumen and training from an inside standpoint as well, I think, to be able to connect the dots and deliver real value from it.  I think if we get to the point where we’re just making decisions willy-nilly because an algorithm is telling us that that is the pattern that that could be certainly the greatest path towards the peril aspect here.

Romi Mahajan:  Agree.  Totally agree.

Note: A special “thank you” goes out to Focus Forward for transcribing the webinar.

“Big Data” Defined

BinaryOn February 21st I moderated the inaugural webinar in a series of sessions about big ideas in market research.  The series is being co-produced and sponsored by Research Access and GreenBook (look for announcement of upcoming installments in the series on Research Access and GreenBook Blog soon). 

The first webinar was on the topic of “Big Data,” a term which is quite a buzzword in the market research community these days. I find people are generally pretty confused about what Big Data is and what tools are available for analyzing it.

We assembled an expert panel for this event, including:

I started the webinar by asking each of the expert panelists for their definition of Big Data.

Dana Stanley:  I’m going to ask each of our expert panelists a simple question. What is “Big Data?”

Steve Cohen: Big data is an interesting question. I’ve heard several definitions.  The first definition is what we call the three-V definition, which is the variety of data that you may get, is one of the Vs.  So that could be coming from social media.  It could be coming as clickstream information.  Could be coming from TV zapping and TV remote control information.  It could be coming even from the CERN Large Hadron Collider in Switzerland.  The second V is velocity, how quickly it comes, and the third is the volume of information.  That’s probably the most common definition.

If I can go to a second definition that I happen to like, it’s what I call the VAST definition, V-A-S-T, which stands for Variable Attributes Subjects, or people, and Time, where one or more of those are in the thousands, the tens of thousands, or even the millions.

But if I could give one more definition, this definition comes from the Berkeley AMP lab, where AMP stands for – A is Algorithms, Machines, and People.  And they basically say Big Data is any data set you have where the data is expensive to manage and hard to extract value from.  So you don’t have to have something like the CERN Large Hadron Collider which generates a petabyte of data every second. You don’t have to have a petabyte of data every second to have big data.  I’ll stop there and let somebody else chime in.

Romi Mahajan: I tend to think, as a marketer, in terms of metaphors.  When I think of Big Data, I think of the fabled Roman god Janus, who had the face of both the creator and the destroyer.  He was a two-faced god.  The creator; when I think about big data, I think about huge sets of data from which you can extract intelligence, you can make meaning, you can find wisdom.  And the destroyer; these tracts of data are so vast, so huge, and so impenetrable it might seem to the naked eye, that we can get caught into the analysis paralysis that comes as a function of having simply too much information to deal with.  So for me, Big Data is an opportunity for wisdom-making, but it’s also potentially a peril in terms of analysis paralysis.  So that’s the metaphor that I use to operate my view of Big Data.

Dana Stanley:  Lenny, how do you define Big Data?

Leonard Murphy: It’s hard to follow up on Romi and Steve there, Dana, so maybe I would turn to a slightly different context and say that big data is the future of how enterprises will be able to more effectively deliver information internally and value to customers.  That’s certainly the business context and, as we look at the definitions of having massive, massive data sets available via social media, via CRM, via the passive applications with mobile and point of sale information, et cetera, Big Data is the process whereby we aggregate that information, extract information out of it, and look at value to clients and to consumers.

Charlie Wardell: So for me, Big Data is a pretty simple term.  It’s an overused term.  There are about two and a half quintillion bytes of data being generated daily.  If you look at the real hardcore definition of what big data is, you’re probably looking at petabytes to exabytes of data.  But I’m a more practical kind of guy.  I think it’s anything south.  I’ve had clients that had big data problems if that were less than a terabyte.  It really depends on what your capabilities are and what your need is.  There is an aspect of Big Data that is not really being addressed – it was touched on earlier – which is the velocity of the data, the speed at which it comes in but, moreover, the speed at which you can process that data.  So there’s a whole new angle to Big Data that is starting to emerge that’s very important.  Most of the Big Data solutions out there attempt to accomplish your analytics and your insights through batch-based processing.

If you think about it, that is good to a point, but there is a need for real time because the Web is real time and insights need to be real time.  So there’s a new aspect of Big Data which I believe strongly is related to looking at the data in real time.  But for me, big data, in a practical sense, is anything that’s not manageable by traditional technology, relational databases like Oracle or SQL Server or MySQL.  It could be a variety of data from pretext to structured text to binary data to YouTube videos.  As long as it has bits and bytes, it’s data.  And if it’s not able to be handled in a conventional means in a real-time fashion, for me it falls into a Big Data category.

Note: A special “thank you” goes out to Focus Forward for transcribing the webinar.

A Discussion of Text Analytics with Michael Tupanjanin

Michael TupanjaninWhat follows is the next in a series of interviews I conducted at the Net Promoter Conference in San Francisco last month.  If you missed my video interview with Dr. Ming Duong-van, you’re going to want to click over for a listen to his fascinating interview.  Still to come is an interview with Satmetrix CEO Richard Owen.  This interview is with Michael Tupanjanin, the CEO of Metavana.  The interview was conducted in the morning on the day Metavana and Satmetrix announced a partnership to create a social Net Promoter Score called the SparkScore.

Dana Stanley: We’re here at the Net Promoter Conference at San Francisco with Michael Tupanjanin, CEO of Metavana, as well as the company’s CMO, Romi Mahajan.

Michael, why don’t you go ahead and tell people who aren’t familiar with Metavana a little bit about your company.

Michael Tupanjanin: Sure, so Metavana was started about three and a half years ago by a guy named Ming Duong-van.  Dr. Ming is very well known in the academic circles primarily as a physicist. He was actually the co-founder of chaos theory. And he’s spent a lot of time studying the text analytics market and has, I think, done some incredible breakthroughs, scientific breakthroughs, specifically the algorithms that he’s written for Metavana that really take a look at text, specifically in the social web, and really uncover the true meaning and opinions that people have on the social web.

Dana Stanley: So when you’re talking about the social web and text, give me a practical sense of what type of data your software’s analyzing.

Michael Tupanjanin:  Well, I think just about every piece of text as far as I know is unstructured on the social web, which can be incredibly chaotic. So if you think about the correlation of people that have studied chaos theory and the clusters of galaxies, you’re actually able to apply that scientific principle to the social web, where the conversations are unstructured, the sentence and the grammatical structures are completely wacky, and the content itself is very unstructured. Being able to actually get meaning out of the second structures is a very difficult thing to do.

Dana Stanley:  What are some examples of how folks are using the Metavana technology to gain insights?

Michael Tupanjanin: We have a couple customers, like Marriott, they have a customer service group that spends a lot of time looking at the social web analyzing things like the basic things, what was your stay like at our hotel? Were the beds OK? Were the towels OK? Was the room service OK? And they’re always analyzing those pieces of information to see how they could improve their service.

We have another company that’s using our technology for smartphones. So right now, the smartphone market is incredibly competitive. We have a clear leader in iPhone, and they’re trying to figure out what their competitive advantage is. What kind of things can they put into their product to make them better? They’re also looking at customer service issues.

Dana Stanley: What do you say to people who throw out the idea that not all sentiment is on the web, that the people who participate in the web, that’s just a segment of all that sentiment that people need to pay attention to.

Michael Tupanjanin: That’s a good question. I’m a neophyte in market research. But here’s my impression. Market research is actually somewhat limited in terms of the sample size, right? You send out a survey to a bunch of people, but the sample size of the social web’s a lot larger than the sample size that you send out to people through your surveys themselves. And I think there’s also a predisposition amongst people that actually are willing to fill out a survey, as opposed to people that are just expressing their opinions on the web where it’s a little less stilted, and you actually probably get more meaningful information back.

Romi Mahajan: Dana, can I just pop in on that?

Dana Stanley: Absolutely.

Romi Mahajan: I think it’s a very prescient questions about how big, how complete is your set, right? And clearly, the social web is not everything, but there are 845 million people on Facebook. There are 250 million, bordering on now 270 million tweets a day. And each of these expresses something. Now, not all of them express sentiment, but a lot do. I think where normal, canonical market research needs to grow and evolve is in the notion of active data collection versus passive data collection, where what people are expressing on the social web is– they’re expressing it while in the context, their natural context.

They’re not being prompted. And so you get a different set of data, right? You get maybe a more natural set, a more authentic set, but a different set. In reality, when you put these two sets together, you get the truth. But the fact that structured data is easier to come by and unstructured data is harder to decipher, that’s what gives a company like Metavana room to maneuver.

Dana Stanley: Where do you think companies are in terms of their approach to this? Are companies diving into sentiment analysis? Are they wary? How would you assess that?

Michael Tupanjanin: I think that the market, in general, is incredibly interested. And I’ll take it to a higher level called text analytics as opposed to sentiment analysis.

Dana Stanley: Sure.

Michael Tupanjanin: I think the market’s incredibly confused. I think the market’s incredibly chaotic right now. There are lots of solutions that are available in the market. And I think a lot of the solutions are incredibly complex to actually do implementations to. So traditionally, a lot of those sentiment analysis or text analytics seem to reside with the knowledge management people inside major companies. And I think there’s a huge opportunity to actually now take it out to the masses, to the functional leaders, the sales leaders, the marketing leaders, the product management leaders, the research leaders, where they really haven’t had access to this kind of technology before.

I think there’s a lot of latent demand for it, but there’s also a confusion because I think so many different companies are approaching it in so many different ways. And I think traditionally the accuracy levels have not been that great. So I think there’s a little bit of skepticism, too.

Dana Stanley: So help me understand Metavana’s unique approach.

Michael Tupanjanin: So without getting into a long, scientific explanation – what it all comes down to is the algorithms that you write and how accurate they are and the principles that you apply. Traditionally, there’s been two approaches to what we’ll call text analytics. There’s been the natural language processing approach and then the more machine-learning approach.

The natural language processing, tends to be a very highly curated approach, like almost a lot of human intervention actually looking at grammatical structures and trying to develop taxonomies to be able to pull out the meaning, versus the statistical approach, which is much more automated and based specifically on algorithms themselves. Traditionally, people have felt that the statistical approach is less accurate, that the natural languaging process approach is more accurate.

However, the natural languaging approach tends to be not scalable because you have to spend a lot of time going through taxonomies versus having a more statistical approach, which is much more scalable. We tend to be more towards the statistical end, but the algorithms that we have written have taken accuracy to a whole new level, up to over 95%.

Romi Mahajan: Dana, it’s a great question. I think Michael answered it correctly on the scientific side. When we think about our business in general, right, we think about three core principles around why we think we’re unique. One is clearly accuracy, right? So whereas the industry is offering scarcely better than a coin toss accuracy, we’re offering one standard deviation away from perfect, so 95%, 96%. The second is what we call accessibility. We don’t believe that customer satisfaction understanding the social web should be sequestered or siloed someplace in the CSAT division of a company. It’s really for everyone.

So we’re building a system that allows any one in the corporation to be able to take– to interpret the social web. Accessibility is the next thing, true enterprise scale. And the third thing is scalability. We believe that our business model is going to offer the ability for anyone, regardless of price point, regardless of degree to which they believe in the social web or not, to access the social web. So those three principles we think make us unique.

Dana Stanley: That’s great. One thing that stood out, you mentioned the accuracy level. I’m just curious, how do you measure accuracy, or how do you self-evaluate as your algorithms presumably evolve?

Michael Tupanjanin: Yeah, we actually have to do it the old fashioned way. We literally will take– we recently did about 3,000 quotes that we actually rated, and we sat down with a bunch of high school kids and actually had them go through sentence by sentence by sentence and see, how would you score this sentence? And how did the machine score the sentence?

Dana Stanley: So you’re basically giving them homework?

Michael Tupanjanin: Absolutely.  There is no other way to do it because you can either do it some kind of automated way, which again, people question whether or not that’s the right way to do it.

Romi Mahajan: The thing is, once you go through the high school exercise, then the system learns on its own. But you have to go through the initial validation period to make sure that if someone leaves Starbucks and says, man, that Americana was awesome, that somebody’s verifying that that’s a positive comment.

Dana Stanley: Yeah, and how do you account for evolving language, and Urban Dictionary entries, and the fluid nature of language?

Michael Tupanjanin: Yeah, so the way the process is set up, we actually– one of our unique things is that we actually do things on a domain by domain basis. So we, for example, we’ll start with smartphones as a category. We’ll start with printers as a category, hotels, or airlines. And each of those domains has their own specific language in them. And one of the things that we do is the engine goes out actually crawls and trains itself on the language of that particular domain. So that’s one of the reasons that we get such high accuracy rates.

But the reality, as you said, is that language continues to evolve. And new words of slang appear all the time. So we found that we have to at least have the engine retrain itself every quarter. And it’s not a manual process. It’s literally simply going out and crawling the same data sources and doing almost like a QA process on the data sources for about a week, and then it’s updated itself on the slang. What it also does is it updates itself on categories. So what the engine does when it goes out and crawls, versus having a taxonomy that’s kind of predetermined, it actually will develop its own taxonomy based on organically what seems to be the right category.

So, for example, we crawled the airline industry, and lo and behold, the categories that came up were seating, crew, entertainment, waiting lines at the airport, baggage handling, all the things you would suspect. But at some point, there could be other categories that emerge.  For example, security, gate security, and stuff like that seems to be starting to percolate on the social web could become a category, too. So that’s part of the engine’s updating process.

Dana Stanley:  Do you sometimes get into arcane industries where maybe the client would have particular language that your incorporating as you go along?

Michael Tupanjanin: Some industries are more difficult than others. We’ve actually looked at, for example, one of our customers is a coffee machine manufacturer. And that’s a fairly simple, straightforward thing versus pharmaceuticals, where you start to get into some pretty arcane language around drugs and therapies, and that’s a lot more difficult. So I don’t know if we have all the answers for you. We’re looking at– pharmaceuticals, I think, will be a little bit of a tougher industry for us.

Dana Stanley: Interesting. And is it just English at this point?

Michael Tupanjanin: English, yes. We’ve done, now, tests in both Chinese and French. And interestingly enough, it’s taken about a day.

Dana Stanley: Wow.

Michael Tupanjanin: Yeah.

Dana Stanley: It took me longer than that to learn French.

Michael Tupanjanin: Well, what’s interesting about the technology, it’s not based on grammatical structure. It just needs to have a translation of all the words themselves, and then it can go out and train itself. So again, it’s a little bit different approach.

Dana Stanley: Interesting. So I have to ask, we’re here at the Net Promoter Conference, and by the time this interview is out, your release will have hit the wires. So tell me about this exciting initiative that you have going with Satmetrix.

Michael Tupanjanin: Well, from our perspective, it’s amazing on a couple of different levels. First, Satmetrix is clearly the leader in Net Promoter. They wrote the book on it. And they have established a very clear set of activities and workflows for people to actually improve their net Promoter Scores. So they are the methodological geniuses and also the workflow geniuses for helping companies improve their Net Promoter Score. And they’ve tied that directly to revenues, which is also a really, really good thing.

I think, from our perspective, being able to provide people a Net Promoter Score like a stock ticker, real-time, is huge. The old model has been you get your survey results back. You work on them and see how you improve over the next quarter. Now, you have an opportunity to actually see how you’re improving every 10 minutes if you need to, which is a huge breakthrough. And this is not an easy thing to do or replicate. From our perspective as a text analytics company, the fact that we have such high accuracy rates and the fact that our machine is flexible enough to actually take somebody else’s methodology and apply that to the social web is huge. There are very few people who can actually do that.

So from our perspective, it’s great. It also makes the information a lot more actionable. One of the things that I think the industry suffers from is that people sit there and say, yes, this sentence is positive. This is negative. Baggage handling was poor in this airport. What are we going to do? Who’s going to get that information, and what are they going to do with it? Being able to tie that to some kind of a standardized score for a company, I think, is a really big deal.

Romi Mahajan: So Dana, in about 45 minutes from this interview, but of course before this interview is published, there’ll be a piece of press on the wire around what we christened the SparkScore, which is a social NPS gauge. And it’s taking the notion of NPS, which is an industry-proven powerful methodology for loyalty and profit driving and completing the picture. The panorama is now complete. It used to be about structured, episodic, survey-based loyalty. And now it’s about the constant here-and-now social web loyalty. So we believe it’s a huge breakthrough for the industry, and Metavana’s very happy to power the SparkScore with, of course, Satmetrix, being the methodology and software provider.

Dana Stanley: So if I’m a customer who’s accustomed to using a Net Promoter Score, what will change for me?

Romi Mahajan: So I think your world gets better, slightly more complex but better, because we’re not saying don’t do normal Net Promoter. There’s a certain value in getting episodic structured data, longitudinally and otherwise. There’s also a certain value in understanding what’s being said anyway, unprompted, every day, 24/7, 365 worldwide. And so when you munge the two, you actually look at your business 360 degrees, as opposed to just seeing one fraction of not only the expression but also the ways in which customers express how they feel.

Dana Stanley:  That’s great, very exciting. So for the traditional, for lack of a better word, market research community, what should they take from this announcement?

Romi Mahajan: Let me break it into two categories. One’s smaller, and one’s bigger. So if the market research people who are familiar with, espouse, or follow NPS, clearly this is going to be a breakthrough, because it’s taking a very proven, powerful methodology and making it 21st century. It’s NPS 2.0. So for the NPS followers, it’s huge.

For the non-NPS followers, we’re all familiar enough with market research to know that it’s grappling with the abundance of data and the abundance of content and the burgeoning importance of the social web. And this allows them to start getting data and data feeds from the social web to use in anything, predictive analytics, reports, analysis of any sort. And so we believe that market research is an incredibly important part of the organization and of the industry.

But we also believe that it’s extremely limited by the technology. And now, we’re opening new business for them. So it’s about reinventing the industry and reinventing ourselves as market researchers.

Dana Stanley: Great. And if people want to learn more about the SparkScore, what should they do?

There’s a couple different things they can do if they’d like to learn more about the SparkScore. One is they can go to metavana.com. Then for second, go to satmetrix.com. Those are the best places to learn about the SparkScore. We will very shortly we will very shortly have a website called spark-score.com, very shortly, so not yet, in which people can play around with this and enter stuff in, and see what their score is.

Michael Tupanjanin: It’s interesting because it almost becomes, in a way, like the Klout score for companies, right? So we’re actually going to be posting a website that actually lists out, front and center, what people’s SparkScore is.

Dana Stanley: Interesting.

Michael Tupanjanin:  So anybody has access to it, whether it’s the companies themselves, customers, they’ll be able to go in, look at their Spark Score. We’re starting by rolling out five industries right now. But we think it’ll actually be very much like a corporate Klout score.

Romi Mahajan: Dana, under your tutelage, one day we hope that Research Access has an sNPS ticker running across it, so every company can come up and say, how are we doing?

Dana Stanley: So almost like a stock ticker concept?

Michael Tupanjanin: It is absolutely a stock ticker concept.

Dana Stanley: Very cool. Well, Michael, Romi, thank you for your time today.

Michael Tupanjanin: Appreciate it.

Romi Mahajan: Dana, our pleasure.