@ThingsExpo Authors: Zakia Bouachraoui, Elizabeth White, Yeshim Deniz, Pat Romanski, Liz McMillan

Related Topics: @DXWorldExpo, @ThingsExpo

@DXWorldExpo: Blog Feed Post

Favorite 2015 @Schmarzo Big Data Blogs | @BigDataExpo #IoT #BigData

It’s that time of year that I review everything that I’ve written over the past year and share my favorite blogs

It’s that time of year that I review everything that I’ve written over the past year and share my favorite blogs. As many of you know, I travel frequently and because I’ve continuously seen every airline movie, I have plenty of time to write. And according to the commentary, every now and then I have a good one. So here are my Top 10 Blogs from 2015!

#10EMC World Day 1: Big Data Business Model Maturity Index and Analytic Profiles. I believe that this is the first blog where I introduce some very important data science concepts: Analytic Profiles and Scores. I talk a lot about the power of Scores in delivering metrics that are potentially better predictors of performance. Scores are important in supporting the decisions you are trying to make and the actions or outcomes you are trying to predict. And Analytic Profiles are how we bring all the analytics insights to support the organization’s key business initiatives.

For example, let’s look at what Bill Schmarzo’s Analytic Profile might be from the perspective of Starbucks.

  • Demographic Information.  This is the basic information about me such as name, home address, work address, age, gender, marital status, income level, value of home, length of time in current home, education level, number of dependents, etc.
  • Behavioral Information:  Now it gets interesting, as we want to uncover behavioral insights that are relevant for the business initiatives that Starbucks is trying to support.  Depending upon the targeted business initiative (e.g., customer acquisition, customer retention, advocacy development, new product introductions), here is some behavioral information that Starbucks might want to capture about me:  favorite drinks in rank order, favorite stores in rank order, most frequent time-of-day to visit a store, most frequent day-of-week to visit a store, recency of store visit, frequency of store visits in past week / month / quarter, monetary value of store visits the past week / month / quarter, how long do I stay at which stores, etc.
  • Classifications.  Now we want to try to create some “classifications” about Bill Schmarzo life that might have impact on my key business initiatives such as:  Lifestage classifications, Lifestyle classifications, Product preference classifications, Store visit classifications, etc.
  • Rules.  We might also want to capture some rules or propensities about Bill’s usage patterns that we can use to support Starbucks’ key business initiatives, including: propensity to buy oatmeal when he buys coffee when traveling in the morning, propensity to buy a cookie/pastry when traveling in the afternoon, propensity to buy product in the channel, propensity to order online, etc.
  • Scores.  We also may want to create scores to support decision-making and process optimization.  Scores that we might want to create could include Customer Lifetime Value Score, Advocacy Score, Loyalty Score, Product Usage Score, Store Visitation Score, etc.

#9Weaving Data Hay Into Business Gold. Some organizations think that they will just jump data into the data lake and the deploy data scientists to “find needles in the data lake haystack.” But the “find a needle in the haystack” is the wrong analogy for the data lake.  “Finding a needle in the haystack” is a data warehouse / Business Intelligence way of thinking about analysis; to slice-and-dice the data haystack trying to find needles.

However, data science with a data lake is more like trying to “weave data hay into business gold.”  So instead of thinking about the data lake as this haystack from which you are trying to find needles, think instead about the data lake as the loom for your data where you weave data hay into business gold.

The primary goal of the data lake, from a business perspective, is to think differently; to enable your data science team to not search for random needles in haystacks, but instead think about how they can leverage the data lake to “weave the data hay into business gold.”

#8An Executive Mandate: Think Open Business.   This blog discusses the power of thinking about an “open business” model, where we define “open” as creating a platform or ecosystem that allows third-parties (developers, partners, resellers) to provide value (and make money) upon that platform or ecosystem. Attacking the market with a closed business model introduces two significant liabilities:

  1. You limit innovation to only the innovation that your company itself can deliver.
  2. You force your customers to only have the choice of only the products that the original manufacturer can develop and deliver.

This blog discusses how Apple and Google addressed the innovation challenge with an open business model that encourages app developers to develop new, innovative products on top of their platform; the model allows third-party app developers to make money on top of the Apple and Google smartphone platforms.  This adds considerable value to their respective platforms and in the process, Apple and Google are transforming their business models from a product-centric business model to a market-enabling business model; one of the key transformations as an organization seeks to “metamorphosize” their business models.

The blog concludes with an exercise on how my favorite kitchen appliance – the Vitamix could create a more open and creative marketplace for its products.

#7The Mid-market Big Data Call to Action. Small organizations seem to have this inferiority complex when it comes to big data. It would seem that the deck is stacked against the small organizations that lack the technology resources to invest or the data experience upon which to leverage to compete with the large companies in the area of big data. However, I think the opposite is true, that small organizations have a HUGE advantage over many of their larger counterparts with respect to integrating data and analytics into their business models including:

  • Smaller organizations have fewer data silos, so they have a much clearer view of their customers, products, operations and markets.
  • Smaller organizations have a smaller number of HIPPO’s (the Highest Paid Person’s Opinion) with which to deal.
  • Smaller organizations can unlearn faster.
  • Smaller organizations are less fixated on technology.
  • But the most important reason is that it is easier for small organizations to institute the organizational and cultural change necessary to actually act on the analytic insights.

#6Why Do I Need A Data Lake? The data lake is a powerful big data architecture that leverages the economics of big data to enable storage, management and analysis of data as compared to traditional data warehouse technologies. The key to maximizing the value of your big data initiatives is the analytics hub and spoke service architecture.

The hub of the architecture is the data lake:

  • Centralized, singular, schema-less data store with raw data
  • Mechanism for rapid ingestion of data with appropriate latency
  • Ability to map data across sources and provide visibility and security to users
  • Catalog to find and retrieve data
  • Costing model of centralized service
  • Ability to manage security, permissions and data masking
  • Supports self-provisioning

The spokes of the architecture are the analytic use cases:

  • Ability to perform analytics (data scientist)
  • Analytics sandbox (HDFS, Hadoop, Spark, Hive, HBase)
  • Data engineering tools (Elastic Search, MapReduce, YARN, HAWQ, SQL)
  • Analytical tools (SAS, R, Mahout, MADlib, H2O)
  • Visualization tools (Tableau, DataRPM, ggplot2)
  • Ability to exploit analytics (application development)
  • 3rd platform application (mobile app development, web site app development)
  • Analytics exposed as services to applications (API’s)
  • Integrate in-memory and/or in-database scoring and recommendations into business process and operational systems

#5In Big Data, Are You Using Refrigerators or Stoves? This blog really challenges how organizations are positioning and selling big data. Too many “experts” are over-emphasizing the big data technology aspects and ignoring the really hard work – understanding what business opportunities exist and how the organization is trying to address them with data and analytics.

My University of San Francisco MBA class finished their Big Data MBA course. We used our trusty “thinking like a data scientist” process to teach our students how to identify a business opportunity, and then use the “thinking like a data scientist” process to drive cross-organizational collaboration to come up with ideas that they can turn into actions using data and analytics.

My co-teacher, the ever talented and energetic Professor Mouwafac Sidaoui, and I asked our students: “What employer wouldn’t want an employee who can excel at doing that?”

#4Big Data Fails: How to Avoid Them. This isn’t exactly a blog. This is an interview that I had with Jessica Davis (InformationWeek) that nicely summarizes many of the keys to big data success. The article actually makes me sound smart (and that’s no small task!). To quote the article:

The companies that run into the most trouble [with big data] are those in which data is in silos, and the thinking about that data is also in silos. For instance, in a banking company there may be a checking account silo and a mortgage silo, and the owners of each group aren’t accustomed to thinking about the whole customer who consumes both services.

Companies that can get past that limitation in their thinking are more likely to be successful with their big data initiatives.

And that example also shows an important factor in successful big data initiatives – collaboration among groups who may not normally collaborate with each other. It relies on team members with different areas of expertise working well together.

“The places where we are seeing success is where the business people and the IT people like each other.”

#3Creativity Is a Team Activity in Big Data. This could have very easily been my favorite blog. It certainly turned out to be one of my most popular blogs.

The potential of big data is only limited by the creative thinking of your business stakeholders. Maybe the biggest inhibitor to creative thinking is the baggage about data and analytics that we have picked up over the years. Organizations need to embrace the power of “thinking differently,” especially with respect to:

  • Data as a strategic asset to be gathered, enriched and shared, versus data as a cost to be minimized
  • The potential of predictive (what is likely to happen) and prescriptive (what should I do) questions versus of just mechanically capturing descriptive (what happened) questions
  • The power of data science to quantify those variables and metrics that are better predictors of performance, versus business intelligence that just reports on what happened while monitoring current business performance
  • Building analytic profiles at the individual (human, machine) level to uncover individual behaviors, tendencies, propensities, interests, passions, associations and affiliations that can lead to specific actionable insights, versus relying on aggregated data to uncover general market trends

#2 – Thinking Like A Data Scientist series. This is probably unfair because this was a four-blog series, but this is my favorite blog(s) from 2015. The series included:

The 8-step “Thinking Like A Data Scientist” process is an enabler for organizations that want to get the most of both their data…and their people. It drives organizational alignment around an organization’s key business initiatives and uncovers where and how big data and data science can optimize key business processes, uncover new monetization opportunities and deliver a more compelling customer experience.

#1Big Data MBA Textbook: Driving Business Strategies with Data Science. Clearly #1 for me was the release of my second Big Data book. This book was written as a textbook to use as part of the class I teach at the University of San Francisco School of Management, but I hope that others can use this textbook to advance big data and data science as business disciplines for tomorrow’s business leaders.

I hope that 2016 is as productive, and given the number of Big Data Vision Workshops that I have to facilitate, I bet it will be!

Favorite 2015 Schmarzo Big Data Blogs
Bill Schmarzo

More Stories By William Schmarzo

Bill Schmarzo, author of “Big Data: Understanding How Data Powers Big Business” and “Big Data MBA: Driving Business Strategies with Data Science”, is responsible for setting strategy and defining the Big Data service offerings for Hitachi Vantara as CTO, IoT and Analytics.

Previously, as a CTO within Dell EMC’s 2,000+ person consulting organization, he works with organizations to identify where and how to start their big data journeys. He’s written white papers, is an avid blogger and is a frequent speaker on the use of Big Data and data science to power an organization’s key business initiatives. He is a University of San Francisco School of Management (SOM) Executive Fellow where he teaches the “Big Data MBA” course. Bill also just completed a research paper on “Determining The Economic Value of Data”. Onalytica recently ranked Bill as #4 Big Data Influencer worldwide.

Bill has over three decades of experience in data warehousing, BI and analytics. Bill authored the Vision Workshop methodology that links an organization’s strategic business initiatives with their supporting data and analytic requirements. Bill serves on the City of San Jose’s Technology Innovation Board, and on the faculties of The Data Warehouse Institute and Strata.

Previously, Bill was vice president of Analytics at Yahoo where he was responsible for the development of Yahoo’s Advertiser and Website analytics products, including the delivery of “actionable insights” through a holistic user experience. Before that, Bill oversaw the Analytic Applications business unit at Business Objects, including the development, marketing and sales of their industry-defining analytic applications.

Bill holds a Masters Business Administration from University of Iowa and a Bachelor of Science degree in Mathematics, Computer Science and Business Administration from Coe College.

IoT & Smart Cities Stories
Early Bird Registration Discount Expires on August 31, 2018 Conference Registration Link ▸ HERE. Pick from all 200 sessions in all 10 tracks, plus 22 Keynotes & General Sessions! Lunch is served two days. EXPIRES AUGUST 31, 2018. Ticket prices: ($1,295-Aug 31) ($1,495-Oct 31) ($1,995-Nov 12) ($2,500-Walk-in)
Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life settlement products to hedge funds and investment banks. After, he co-founded a revenue cycle management company where he learned about Bitcoin and eventually Ethereal. Andrew's role at ConsenSys Enterprise is a mul...
Business professionals no longer wonder if they'll migrate to the cloud; it's now a matter of when. The cloud environment has proved to be a major force in transitioning to an agile business model that enables quick decisions and fast implementation that solidify customer relationships. And when the cloud is combined with the power of cognitive computing, it drives innovation and transformation that achieves astounding competitive advantage.
Nicolas Fierro is CEO of MIMIR Blockchain Solutions. He is a programmer, technologist, and operations dev who has worked with Ethereum and blockchain since 2014. His knowledge in blockchain dates to when he performed dev ops services to the Ethereum Foundation as one the privileged few developers to work with the original core team in Switzerland.
Machine learning has taken residence at our cities' cores and now we can finally have "smart cities." Cities are a collection of buildings made to provide the structure and safety necessary for people to function, create and survive. Buildings are a pool of ever-changing performance data from large automated systems such as heating and cooling to the people that live and work within them. Through machine learning, buildings can optimize performance, reduce costs, and improve occupant comfort by ...
René Bostic is the Technical VP of the IBM Cloud Unit in North America. Enjoying her career with IBM during the modern millennial technological era, she is an expert in cloud computing, DevOps and emerging cloud technologies such as Blockchain. Her strengths and core competencies include a proven record of accomplishments in consensus building at all levels to assess, plan, and implement enterprise and cloud computing solutions. René is a member of the Society of Women Engineers (SWE) and a m...
IoT is rapidly becoming mainstream as more and more investments are made into the platforms and technology. As this movement continues to expand and gain momentum it creates a massive wall of noise that can be difficult to sift through. Unfortunately, this inevitably makes IoT less approachable for people to get started with and can hamper efforts to integrate this key technology into your own portfolio. There are so many connected products already in place today with many hundreds more on the h...
Digital Transformation is much more than a buzzword. The radical shift to digital mechanisms for almost every process is evident across all industries and verticals. This is often especially true in financial services, where the legacy environment is many times unable to keep up with the rapidly shifting demands of the consumer. The constant pressure to provide complete, omnichannel delivery of customer-facing solutions to meet both regulatory and customer demands is putting enormous pressure on...
Charles Araujo is an industry analyst, internationally recognized authority on the Digital Enterprise and author of The Quantum Age of IT: Why Everything You Know About IT is About to Change. As Principal Analyst with Intellyx, he writes, speaks and advises organizations on how to navigate through this time of disruption. He is also the founder of The Institute for Digital Transformation and a sought after keynote speaker. He has been a regular contributor to both InformationWeek and CIO Insight...
Digital Transformation: Preparing Cloud & IoT Security for the Age of Artificial Intelligence. As automation and artificial intelligence (AI) power solution development and delivery, many businesses need to build backend cloud capabilities. Well-poised organizations, marketing smart devices with AI and BlockChain capabilities prepare to refine compliance and regulatory capabilities in 2018. Volumes of health, financial, technical and privacy data, along with tightening compliance requirements by...