Beruflich Dokumente
Kultur Dokumente
Contributor
Ryan Dolen
Senior Data Scientist
questions@luxresearchinc.com
Innovate Smarter
& Grow Faster
ABSTRACT
AI and machine learning are hyped for use in applications as
varied as marketing and predictive maintenance of machines. The
innovation community has long desired tools to enable greater
efficiency and impact and is now considering whether AI can deliver.
This white paper analyzes tools for parsing innovation data and
pulling out insights in an efficient and unbiased way in the ideation
and vetting stages of innovation. It discusses needs, approaches,
uses cases, and best practices to help organizations leverage the
tools for maximum impact.
TABLE OF CONTENTS
Executive Summary......................................................................................2
Natural language processing and topic modeling are the ML tools of choice.....4
1
EXECUTIVE SUMMARY
Artificial intelligence (AI) and machine learning (ML) have become ubiquitous concepts across many industries. The algorithms
and libraries to do it yourself are increasingly commoditized, so deploying and training a model can be done in the hands of a
skilled data scientist in a short amount of time. Corporate innovators, entrepreneurs, venture capitalists, and government funding
agencies are among those seeking to leverage AI/ML to identify the next impactful and profitable technology. However, this
application lacks training data and quick feedback loops to verify whether today’s decisions will bear fruit.
Innovators must consider the why, what, and how of AI for their process
To bring clarity, this white paper explains why innovators need better solutions, what use cases to attack, and how to deploy AI.
• Finding the right technologies and companies to invest in is rife with challenges. These challenges include turning amorphous
ideas into action, moving through the pipeline faster, overcoming bias, and understanding the competitive landscape.
• Use cases for AI and ML in innovation are wide-ranging and include landscaping, concept similarity, competitive analysis,
and trend identification.
• Because innovation data is often largely text, natural language processing (NLP) technologies, and specifically topic modeling,
are the ideal tool to extract insight from a sea of data.
To illustrate the power of this approach, we use topic modeling to landscape the space of 2D materials and show how significant
insights fall out of a single model.
Best practices can help you get started while avoiding missteps
Organizations wishing to implement AI and ML for innovation must consider some key issues when starting any digital
transformation initiative. These issues include whether to buy or build, how to integrate humans into the loop, and how to
implement cultural change. Because data science is not a skill that’s readily available to build for every corporation, partnership is
wise, as is a plan to enable cultural change so that the technology augments the human – rather than replacing it.
2 questions@luxresearchinc.com
MACHINE LEARNING IS EASY;
EXTRACTING VALUE IS HARD
Artificial intelligence and machine learning have become ubiquitous concepts across many industries. The algorithms and libraries
to do it yourself are increasingly commoditized, so deploying and training a model can be done in the hands of a skilled data
scientist in a short amount of time. Increasingly, there are vendors that layer on beautiful user interfaces, scalable platforms, and
specific use cases to solve problems as varied as scoring marketing leads and predicting when a motor will fail. In all of these
cases, human understanding of the challenge to be solved is paramount.
Count corporate innovators, entrepreneurs, venture capitalists, and government funding agencies among those seeking to leverage
AI/ML to identify the next impactful and profitable technology. Indeed, many a client has come to us saying, “We’d like to use AI to
improve our innovation process… but where and how?” Unlike the examples mentioned above, this application lacks training data
and quick feedback loops to verify whether today’s decisions will bear fruit. Specifically, this group is challenged to:
• Overcome bias.
If the process wholly relies on people to taxonomize and investigate a given space, the result may be biased due to the limited
knowledge and experience of the person conducting the investigation. As a result, the analysis could miss a technology or idea
that could be fruitful simply because they didn’t think of it at the beginning of the search.
3 questions@luxresearchinc.com
Our data science team built the Lux Intelligence Engine to address these challenges for both our clients and our own analysts.
After developing and experimenting with various tools, we found two main benefits of ML for the front end of innovation:
• Speed.
Applying ML to data sets like patents can generate fast landscapes that identify key areas of innovation as well as important
players within those areas. Whereas a traditional approach would require a thorough taxonomy and numerous search strings
to start with, an ML approach requires only a general definition of the space. ML can also help prioritize what documents are
worth spending time on. Furthermore, it buckets key concepts and data points automatically, which would require significant
time to do manually. For example, we were able to identify the innovation focus areas in the cannabis space, visualize trends,
and identify white space using machine learning, reducing the analysis effort from weeks to days.
• Landscaping.
Given an area of interest, ML can automatically build a taxonomy that defines key areas of innovation underneath a larger
concept. The starting concept can be arbitrary in size (e.g., as broad as “sustainability” or as specific as “superabsorbent
polymers”). The ML algorithm can extract key topics from the resulting data set, and cluster and rank organizations and their
associated documents by the strength of the link between the text of the document and the topic. An expert is still needed to
interpret and label each topic, but with these results a user can identify leading players for each topic and easily extract
trends on which topics are growing or shrinking.
4 questions@luxresearchinc.com
• Concept similarity.
In conducting a freedom-to-operate analysis, or simply in trying to find relevant academic papers close to a known concept,
finding and prioritizing relevant documents is extremely valuable. Using NLP technology, a model can establish a fingerprint for
any given document and use that to identify other documents with similar fingerprints, highlighting which ones are most
relevant to the subject of interest. Automating even a portion of the arduous document review process can significantly
improve efficiency and time to actionable insight.
As displaying the power of ML for all the possible use cases is beyond the scope of a single document, we will provide an example
of landscaping here, as it is a universal exercise across all of the organizations we work with.
Methodology
We began by defining a master patent search that was as general as possible while staying focused on 2D materials. Then, we
trained a topic model using all those patents (approximately 7,500 globally). The model identified 14 topics; not all were relevant,
so only a portion of those are presented here. By looking at the associated keywords and patents for each topic, we labeled the
topics based on our familiarity with the technologies and applications. From there, we extracted trends and leaders for the different
topics.
Example Results
In Figure 3, we show a market map, but one that differs from other maps you may have seen:
• The groupings are defined by the topic model plus human-assigned labels.
Once fed the master search, the topic model identified key topics that are commonly occurring. Those topics are defined by
keywords, which a subject matter expert uses to label the topics.
5 questions@luxresearchinc.com
• The logos are ordered by strength in the topic.
While many maps have a random assortment of logos, we selected the top four patentors associated with each topic and
show them in descending order from left to right. The model identified many more key players, but only the top four are
shown here as an example of the quick visual insight that can fall out of the topic model.
• Leading innovators are not just startups, and some players come up more than once.
Since the goal is to provide insight on the 2D materials space, the map below is comprehensive in covering startups, large
companies, and academic institutions. Furthermore, as you might expect, certain players lead in more than one space and
thus show up more than once on the map (see, e.g., Samsung Electronics).
6 questions@luxresearchinc.com
Additionally, we can compare topics against each other to evaluate the patenting trajectory. Figure 4 shows patent
trends for six topics and conveys where activity is focused and how it has changed in recent years.
This analysis is a brief display of some of the rich insight that falls out of a single model. Additionally, there’s the
benefits of speed (from weeks to days) and the unbiased nature of results. We have found that further mining of the
data can lead to a thorough segmentation of innovation activity in a given space and a solid view of the competitive
landscape. It’s possible to analyze more topics, break topics down into subtopics, show trends across all topics, and
more. Again, landscaping is just one use case for topic modeling, which holds great potential to change how we as a
community execute on innovation.
Some organizations already have established teams using data science in their traditional R&D and product development;
one strategy might be to reassign them to work on improving innovation processes across the organization. The other option
is to work with a partner with expertise on sourcing, scrubbing, organizing, and analyzing data. For many, this will be the
most efficient route; however, beware of pure data shops that don’t understand the use case or goals of the deployment. The
likely solution will include both building and buying, to combine internal domain expertise with ML capabilities and scalable
platforms.
We have introduced a tangible way for machine learning to impact current innovation processes by creating efficiencies and
identifying opportunities that may have been buried in vast and fast-changing data sets. Ultimately, we are only at the beginning
of learning how ML can improve the way we innovate, and we expect proliferation of use cases like those above, as well as the
emergence of new ones that will create an improved ROI on our innovation efforts and investments.
7 questions@luxresearchinc.com
Lux Research is a leading provider of tech-enabled research
and advisory services, helping clients drive growth through
technology innovation. A pioneer in the research industry, Lux
uniquely combines technical expertise and business insights with
a proprietary intelligence platform, using advanced analytics and
data science to surface true leading indicators. With quality data
derived from primary research, fact-based analysis, and opinions
that challenge traditional thinking, Lux empowers clients to make
more informed decisions today to ensure future success.
luxresearchinc.com
questions@luxresearchinc.com
@LuxResearch Webinars
Downloaded by Nitin Sareen, Aditya Birla Science and Technology Co. Pvt. Ltd on January 09, 2020
Use of this document is limited to paid members for their individual business purpose and may not be further disclosed, disseminated, distributed, reproduced, or
retransmitted to any person, firm, or entity without Lux Research, Inc.’s written permission. You may request permission by emailing approvals@luxresearchinc.com
© 2019 Lux Research Inc. Copyright strictly enforced.