Topic Tagging with Watson by Ken Goldberg, UC Berkeley

•

1 gostou•805 visualizações

This document discusses using Watson Natural Language Classifier to tag ideas from an M-CAFE dataset with topics. 106 ideas were split randomly into a training set of 86 tagged ideas and a test set of 20 untagged ideas. Watson was trained on the training set and tested on the test set, achieving an accuracy of 80%. Examples of correctly and incorrectly classified ideas are provided. Questions are also included about how the classifier is trained and whether unspervised classification is available.

Tecnologia

Dataset
§ M-CAFE for IEOR 115: 16
weeks in Aug - Dec, 2015
• Student count: 115
• Idea count: 106
§ 106 ideas with tags are
split randomly into train
(86 ideas) and test (20
ideas).

Train&Test Sets
• Train: 86 ideas with topics tagged.
• Test: 20 ideas without topics tagged.
Screen capture of the .csv file for training set

$Code • curl -i -u "896090f0-631f-4745-b02a- 47b6417140d6":"xuDyj6lD9USr" -F training_data=@/Users/apple/Desktop/mcafe_watson_train.c sv -F training_metadata="{"language":"en","name":"McafeCl assifier"}" "https://gateway.watsonplatform.net/natural- language-classifier/api/v1/classifiers" • curl -G -u "896090f0-631f-4745-b02a- 47b6417140d6":"xuDyj6lD9USr" "https://gateway.watsonplatform.net/natural-language- classifier/api/v1/classifiers/3AE103x13-nlc-1276/classify" -- data-urlencode"text=testData"$

Test Result: 80% Accuracy!
Out of the 20 test samples, 16
were corrected classified.

Idea Topic
Slower pace. Lectures
Add Lecture overview Resources
I want more practice with Relational Algebra and eventually SQL. Homework
The last few lectures have been very mathematically precise in
notation which can make it a bit tricky to wrap your head around.
Specific questions/examples (like what might be on hw) would be
great to help us make sure we understand it moving forward.
Lectures
The project seems a little stop and go. We haven't been able to
work on it for a week or so but I feel like we'll soon be expected
to do a bunch of work for DP2. It would be helpful if we could
have the tools to have a more constant level of work on the
project.
Projects
Please try and post the labs earlier so that we can get a head
start reading and understanding them.
Labs
Homework 2 only has database questions, maybe put some
connectives?
Homework
Incorporate a short question and answer period midway of
lecture to assess participating students' understanding of the
lecture/topics being presented.
Lectures
Examples of ideas which are correctly classified:

Misclassifications
• The true tag is among the top two tags suggested by the
classifier.
• Misclassification occurs when an idea is arbitrarily tagged
or with lack of context.
Idea True Tag Pred Tag Confidence
1. slow down a little bit Lectures Resources
Resources: 0.288;
Lectures:0.224
2. It would be great if
you could provide
outside resources on
rules and guidelines for
things like ER diagrams
that you think are worth
our time.
Resources Lectures
Lectures: 0.879;
Resources:0.130

Idea True Tag Pred Tag Confidence
3. I would like have some
implantation problems
using SQL
Homework New Topics
New Topics:
0.803;
Homework:
0.076
4. More hands on
experiences on Databases
Homework New Topics
New Topics:
0.786;
Homework:
0.117
Misclassifications Contd…
• The true tag is among the top two tags suggested by the
classifier.
• Misclassification occurs when an idea is arbitrarily tagged
or with lack of context.

Questions for IBM
• 1. How is the classifier trained? What is the
classification method?
• 2. Is there a version of the classifier that can return
the predicted topic for the test set?
• 3. This essentially a supervised classification
problem, does Watson have an unsupervised
version available, just provide raw text and it
would assign tags?

Mais conteúdo relacionado

Semelhante a Topic Tagging with Watson by Ken Goldberg, UC Berkeley

Deep learning QuantUniversity meetup

QuantUniversity

“Machine Learning in Production + Case Studies” by Dmitrijs Lvovs from Epista...

DevClub_lv

PuppetConf 2016: The Long, Twisty Road to Automation: Implementing Puppet at ...

Puppet

What is the Natural Language Classifier API? How can it enhance you current app development efforts? Rahul Garg, a Strategy and Offering Manager focused on NLC, walks through this unique API. Also receive specific examples on how to train and call the classifier. Watch the live replays and learn how to build cognitive apps using other Watson APIs during our Building With Watson web series: https://www.ibm.com/smarterplanet/us/en/ibmwatson/building-with-watson-webinar.html

Building with Watson - Interpreting Language Using the Natural Language Class...

IBM Watson

Sam zhang week2demo copy

Chentao Zhang

Hierarchical Classification by Jurgen Van Gael

PyData

Semelhante a Topic Tagging with Watson by Ken Goldberg, UC Berkeley (6)

Deep learning QuantUniversity meetup

“Machine Learning in Production + Case Studies” by Dmitrijs Lvovs from Epista...

PuppetConf 2016: The Long, Twisty Road to Automation: Implementing Puppet at ...

Building with Watson - Interpreting Language Using the Natural Language Class...

Sam zhang week2demo copy

Hierarchical Classification by Jurgen Van Gael

Mais de diannepatricia

Teaching cognitive computing with ibm watson

diannepatricia

Cognitive systems institute talk 8 june 2017 - v.1.0

diannepatricia

Building Compassionate Conversational Systems

diannepatricia

“Artificial Intelligence, Cognitive Computing and Innovating in Practice”

diannepatricia

Cognitive Insights drive self-driving Accessibility

diannepatricia

Artificial Intellingence in the Car

diannepatricia

“Semantic PDF Processing & Document Representation”

diannepatricia

Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...

diannepatricia

170330 cognitive systems institute speaker series mark sherman - watson pr...

diannepatricia

“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”

diannepatricia

Cognitive Assistance for the Aging

diannepatricia

From complex Systems to Networks: Discovering and Modeling the Correct Network"

diannepatricia

The Role of Dialog in Augmented Intelligence

diannepatricia

Developing Cognitive Systems to Support Team Cognition

diannepatricia

Cyber-Social Learning Systems

diannepatricia

"Curious Learning: using a mobile platform for early literacy education as a ...

diannepatricia

Embodied Cognition - Booch HICSS50

diannepatricia

KATE - a Platform for Machine Learning

diannepatricia

Cognitive Computing for Aging Society

diannepatricia

Hicss17 asakawa

diannepatricia

Mais de diannepatricia (20)

Teaching cognitive computing with ibm watson

Cognitive systems institute talk 8 june 2017 - v.1.0

Building Compassionate Conversational Systems

“Artificial Intelligence, Cognitive Computing and Innovating in Practice”

Cognitive Insights drive self-driving Accessibility

Artificial Intellingence in the Car

“Semantic PDF Processing & Document Representation”

Joining Industry and Students for Cognitive Solutions at Karlsruhe Services R...

170330 cognitive systems institute speaker series mark sherman - watson pr...

“Fairness Cases as an Accelerant and Enabler for Cognitive Assistance Adoption”

Cognitive Assistance for the Aging

From complex Systems to Networks: Discovering and Modeling the Correct Network"

The Role of Dialog in Augmented Intelligence

Developing Cognitive Systems to Support Team Cognition

Cyber-Social Learning Systems

"Curious Learning: using a mobile platform for early literacy education as a ...

Embodied Cognition - Booch HICSS50

KATE - a Platform for Machine Learning

Cognitive Computing for Aging Society

Hicss17 asakawa

Último

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

What is a good lead in your organisation? Which leads are priority? What happens to leads? When sales and marketing give different answers to these questions, or perhaps aren't sure of the answers at all, frustrations build and opportunities are left on the table. Join us for an illuminating session with Cian McLoughlin, HubSpot Principal Customer Success Manager, as we look at that crucial piece of the customer journey in which leads are transferred from marketing to sales.

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

HampshireHUG

Sara Mae O’Brien Scott and Tatiana Baquero Cakici, Senior Consultants at Enterprise Knowledge (EK), presented “AI Fast Track to Search-Focused AI Solutions” at the Information Architecture Conference (IAC24) that took place on April 11, 2024 in Seattle, WA. In their presentation, O’Brien-Scott and Cakici focused on what Enterprise AI is, why it is important, and what it takes to empower organizations to get started on a search-based AI journey and stay on track. The presentation explored the complexities of enterprise search challenges and how IA principles can be leveraged to provide AI solutions through the use of a semantic layer. O’Brien-Scott and Cakici showcased a case study where a taxonomy, an ontology, and a knowledge graph were used to structure content at a healthcare workforce solutions organization, providing personalized content recommendations and increasing content findability. In this session, participants gained insights about the following: Most common types of AI categories and use cases; Recommended steps to design and implement taxonomies and ontologies, ensuring they evolve effectively and support the organization’s search objectives; Taxonomy and ontology design considerations and best practices; Real-world AI applications that illustrated the value of taxonomies, ontologies, and knowledge graphs; and Tools, roles, and skills to design and implement AI-powered search solutions.

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Enterprise Knowledge

Data Cloud, More than a CDP by Matt Robison

Anna Loughnan Colquhoun

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Presentation on how to chat with PDF using ChatGPT code interpreter

naman860154

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Delhi Call girls

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Evaluating the top large language models.pdf

ChristopherTHyatt

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

[2024]Digital Global Overview Report 2024 Meltwater.pdf

hans926745

Scaling API-first – The story of a global engineering organization

Radu Cotescu

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

Histor y of HAM Radio presentation slide

vu2urc

Finology Group – Insurtech Innovation Award 2024

The Digital Insurer

CNv6 Instructor Chapter 6 Quality of Service

giselly40

Topic Tagging with Watson by Ken Goldberg, UC Berkeley

1. M-CAFE Topic Tagging With Watson

2. Dataset § M-CAFE for IEOR 115: 16 weeks in Aug - Dec, 2015 • Student count: 115 • Idea count: 106 § 106 ideas with tags are split randomly into train (86 ideas) and test (20 ideas).

3. Watson NaturalLanguageClassifier

5. Train&Test Sets • Train: 86 ideas with topics tagged. • Test: 20 ideas without topics tagged. Screen capture of the .csv file for training set

6. Code • curl -i -u "896090f0-631f-4745-b02a- 47b6417140d6":"xuDyj6lD9USr" -F training_data=@/Users/apple/Desktop/mcafe_watson_train.c sv -F training_metadata="{"language":"en","name":"McafeCl assifier"}" "https://gateway.watsonplatform.net/natural- language-classifier/api/v1/classifiers" • curl -G -u "896090f0-631f-4745-b02a- 47b6417140d6":"xuDyj6lD9USr" "https://gateway.watsonplatform.net/natural-language- classifier/api/v1/classifiers/3AE103x13-nlc-1276/classify" -- data-urlencode"text=testData"

7. Test Result: 80% Accuracy! Out of the 20 test samples, 16 were corrected classified.

8. Idea Topic Slower pace. Lectures Add Lecture overview Resources I want more practice with Relational Algebra and eventually SQL. Homework The last few lectures have been very mathematically precise in notation which can make it a bit tricky to wrap your head around. Specific questions/examples (like what might be on hw) would be great to help us make sure we understand it moving forward. Lectures The project seems a little stop and go. We haven't been able to work on it for a week or so but I feel like we'll soon be expected to do a bunch of work for DP2. It would be helpful if we could have the tools to have a more constant level of work on the project. Projects Please try and post the labs earlier so that we can get a head start reading and understanding them. Labs Homework 2 only has database questions, maybe put some connectives? Homework Incorporate a short question and answer period midway of lecture to assess participating students' understanding of the lecture/topics being presented. Lectures Examples of ideas which are correctly classified:

9. Misclassifications • The true tag is among the top two tags suggested by the classifier. • Misclassification occurs when an idea is arbitrarily tagged or with lack of context. Idea True Tag Pred Tag Confidence 1. slow down a little bit Lectures Resources Resources: 0.288; Lectures:0.224 2. It would be great if you could provide outside resources on rules and guidelines for things like ER diagrams that you think are worth our time. Resources Lectures Lectures: 0.879; Resources:0.130

10. Idea True Tag Pred Tag Confidence 3. I would like have some implantation problems using SQL Homework New Topics New Topics: 0.803; Homework: 0.076 4. More hands on experiences on Databases Homework New Topics New Topics: 0.786; Homework: 0.117 Misclassifications Contd… • The true tag is among the top two tags suggested by the classifier. • Misclassification occurs when an idea is arbitrarily tagged or with lack of context.

11. Questions for IBM • 1. How is the classifier trained? What is the classification method? • 2. Is there a version of the classifier that can return the predicted topic for the test set? • 3. This essentially a supervised classification problem, does Watson have an unsupervised version available, just provide raw text and it would assign tags?

Topic Tagging with Watson by Ken Goldberg, UC Berkeley

Recomendados

Recomendados

Mais conteúdo relacionado

Semelhante a Topic Tagging with Watson by Ken Goldberg, UC Berkeley

Semelhante a Topic Tagging with Watson by Ken Goldberg, UC Berkeley (6)

Mais de diannepatricia

Mais de diannepatricia (20)

Último

Último (20)

Topic Tagging with Watson by Ken Goldberg, UC Berkeley