Speech Technology and Big Data

•Transferir como PPTX, PDF•

1 gostou•1,988 visualizações

Nick Campbell is a speech scientist who has held research positions at AT&T Bell Labs, IBM UK Scientific Centre, and ATR basic telecom research. He has also served on boards and as a professor. Campbell discusses the growth of speech and multimedia data collection over time, challenges around data management and privacy, and the need for standardized tools and resources to support research using large corpora.

Tecnologia Saúde e medicina

*
Nick Campbell
Speech Communication Lab
Trinity College Dublin, Ireland

*
* TCD – Stokes Professor (Dublin)
* CNGL – PI – Delivery & Interaction
* ELRA – board member / VP – speech
* ISCA – board member – workshops
* IEEE – Sig Proc Soc - SLTC member
* ATR/NiCT – research director(Japan)
* Speech Prosody 2014 (Dublin) host

* Speech scientist/researcher/corpus analyst

* AT&T Bell Labs
* The ideas people – think ‘BIG’

* IBM UK Scientific Centre
* The corpus people – ‘collect it all’

* ATR basic telecom research
* The fundamentals - learn how to ‘infer’ from it

*

* we used to be considered BIG – speech data
(and now multimedia) gobbled up memory
* I collected 1500 hours of everyday chat/daily
conversations in 2000 – (@1GB per minute) -
took 5-years to process!

* now Apple, Google, Ms, .. get that each minute
(but the secret is in the metadata)

* we need accessible data & tools for everybody!

*

* but we need to manage privacy issues first!

*

* and we need a way to protect IP as well

* written publications have ISBN standard
* work is now underway (cf ELRA & COCOSDA) to
institute ISLRN for Language Resources
* researchers need to get credit for corpora as
well as for publishing research results
* The community needs a way to identify,
acknowledge, attribute, and reference data

*

* tools for processing speech & multimodal data

* htk, hts, R, etc . . . not simple to use

* little consensus on what features to encode

* manual bootstrap – much too time-consuming!

*

* social interaction

* personal idiosyncracies

* group dynamics – multimodal data (TB/hr)

* issues of robustness / domain specificity /
privacy / storage & archiving / redistribution

*

context analytics:

* cultural and language-specific needs
* multimodal – multimedia – multilingual
* tools for ‘less-well-supported’ languages

* e.g., U-STAR consortium for speech research –
sharing tools & data & knowledge for research

*

* European Language Resources Association
* COCOSDA – int’l coordinating committee
* IEEE SLTC, ISCA SIGS, there are places to go

* but are they ready for really BIG data?
perhaps not yet . . .

*

* curricula prepare people

* what standards to rely on?
* what resources available?
* what features to extract?
* what tools to work with?
* what use to put it to?
* what info to hide?
* what to do next?

*

Mais conteúdo relacionado

Destaque

Relational Database to RDF (RDB2RDF)

EUCLID project

Comment manager des geeks - Devoxx 2015

Publicis Sapient Engineering

Annotation Processor, trésor caché de la JVM

Raphaël Brugier

Querying Linked Data on Android

EUCLID project

This presentation addresses the main issues of Linked Data and scalability. In particular, it provides gives details on approaches and technologies for clustering, distributing, sharing, and caching data. Furthermore, it addresses the means for publishing data trough could deployment and the relationship between Big Data and Linked Data, exploring how some of the solutions can be transferred in the context of Linked Data.

Scaling up Linked Data

EUCLID project

This presentation focuses on providing means for exploring Linked Data. In particular, it gives an overview of current visualization tools and techniques, looking at semantic browsers and applications for presenting the data to the end used. We also describe existing search options, including faceted search, concept-based search and hybrid search, based on a mix of using semantic information and text processing. Finally, we conclude with approaches for Linked Data analysis, describing how available data can be synthesized and processed in order to draw conclusions.

Interaction with Linked Data

EUCLID project

This presentation looks in detail at SPARQL (SPARQL Protocol and RDF Query Language) and introduces approaches for querying and updating semantic data. It covers the SPARQL algebra, the SPARQL protocol, and provides examples for reasoning over Linked Data. We use examples from the music domain, which can be directly tried out and ran over the MusicBrainz dataset. This includes gaining some familiarity with the RDFS and OWL languages, which allow developers to formulate generic and conceptual knowledge that can be exploited by automatic reasoning services in order to enhance the power of querying.

Querying Linked Data

EUCLID project

This presentation gives details on technologies and approaches towards exploiting Linked Data by building LD applications. In particular, it gives an overview of popular existing applications and introduces the main technologies that support implementation and development. Furthermore, it illustrates how data exposed through common Web APIs can be integrated with Linked Data in order to create mashups.

Building Linked Data Applications

EUCLID project

This presentation introduces the main principles of Linked Data, the underlying technologies and background standards. It provides basic knowledge for how data can be published over the Web, how it can be queried, and what are the possible use cases and benefits. As an example, we use the development of a music portal (based on the MusicBrainz dataset), which facilitates access to a wide range of information and multimedia resources relating to music.

Usage of Linked Data: Introduction and Application Scenarios

EUCLID project

Découvrez les annotations Java comme vous ne les avez jamais vues ! Olivier Croisier, expert Java, anime une conférence de deux heures sur les Annotations, à destination des développeurs et des architectes. Elle couvre leur utilisation, développement, et manipulation au compile-time et au run-time grâce aux Annotation Processors et à la Réflexion. * Présentation : Historique, cas d'utilisations et limitations * Tour d'horizon des annotation disponibles * Utilisation des annotations * Développer une annotation personnalisée : structure, propriétés et méta-annotations * Outillage compile-time : les pluggable annotation processors * Outillage runtime : Réflexion * Injection d'annotations * Conclusion

Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG

Zenika

A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...

SlideShare

Destaque (11)

Relational Database to RDF (RDB2RDF)

Comment manager des geeks - Devoxx 2015

Annotation Processor, trésor caché de la JVM

Querying Linked Data on Android

Scaling up Linked Data

Interaction with Linked Data

Querying Linked Data

Building Linked Data Applications

Usage of Linked Data: Introduction and Application Scenarios

Conférence sur les annotations Java par Olivier Croisier (Zenika) au Paris JUG

A Guide to SlideShare Analytics - Excerpts from Hubspot's Step by Step Guide ...

Semelhante a Speech Technology and Big Data

GBIF BIFA mentoring in Los Banos, Philippines for the South-East Asian ASEAN Biodiversity Heritage Parks. With Dr. Yu-Huang Wang, Dr. Po-Jen Chiang, and Guan-Shuo Mai from TaiBIF the GBIF node of Taiwan (Chinese Tapei); and the Biodiversity Informatics team at ASEAN Centre For Biodiversity. http://www.gbif.no/events/2016/gbif-bifa-mentoring.html Credits: EUDAT/OpenAire, December 2015 & May 2016, CC-BY-4.0 * http://www.slideshare.net/EUDAT/eudat-research-data-management * http://www.slideshare.net/EUDAT/research-data-management-introduction-eudatopen-aire-webinar?ref=https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat * https://eudat.eu/events/webinar/research-data-management-an-introductory-webinar-from-openaire-and-eudat * http://www.instantpresenter.com/WebConference/RecordingDefault.aspx?c_psrid=EB57D6888147

GBIF BIFA mentoring, Day 5a Data management, July 2016

Dag Endresen

Born Digital Archives

LIFE-SHARE Project

Importance of Database in Library

Department of Library and Information Science, HPT Arts and RYK Science College, Nashik

IWST 2013: Intro

ESUG

Presentation given by Kathryn Cassidy, Software Engineer, Digital Repository of Ireland, on May 11th, 2016 in the Royal Irish Academy, Dublin, as part of the DRI Training Series 'Preparing Your Collection for DRI'. The seminar introduced attendees to the principles of metadata and metadata standards, with an emphasis on the standards used for ingest of collections into DRI. The seminar also introduced the subject of XML.

Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

dri_ireland

Cloud Programming Models: eScience, Big Data, etc.

Alexandru Iosup

DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...

dri_ireland

Keynote at iUser 2011 in Malaysia Abstract: The interface to personal computers has changed little in more than 30 years since the landmark Xerox Star. Beneath layers of metaphor for the user and window management system abstractions for the developer, there is a deep underlying model of disk + processing + screen & mouse/keyboard. While the mouse and physical keyboard have sometimes morphed into touch screen and soft keyboard, and network and cloud devices masquerade as local disks, the underlying model is unchanged. However, users have seen their personal information, which was previously fragmented one filing system, email, etc. further dispersed onto Flickr, cloud services and social networks. To some extent mobile platforms, iOS, Android, Windows ME, present a different model, but if anything often heavily fire-walled Apps further fragment the user experience. What might a personal information environment be like that took seriously the fact that the objects of interest to a user are photos and documents, not files stored on disks, jobs to be done not apps?

iUser2011 Keynote: The Personal Information Environment beyond the Personal C...

Alan Dix

dbGLOVE (presentation at Silicon Valley Personal Health Technology)

QIRIS

Takeda 101214short-d

Culture Mondo Network Asia-Pacific Secretariat

Six Use Cases for Edinburgh DataShare

Robin Rice

Using islandora to build digital collections - 2016.01.29 OLA 2016

KellliBee

Digital Archive of Knowledge for Sharing and Re-using

National Institute of Informatics (NII)

Challenges for Linked Data in Japan

National Institute of Informatics (NII)

Useful unstructured text occurs in plentiful amounts, and often is central to the success of a business. The benefits of being able to successfully decipher unstructured text can be direct or derived. Companies which offer products for medical differential diagnosis are directly benefitted by the ability to correctly extract drug-disease interactions from publications, for example. As for derived benefits of text processing, we need to look no further than cases of improving process flows by analyzing the sentiment of the emails a company receives from its customers. Being at the frontier of natural language processing, information representation and retrieval, information extraction has been the subject of extensive research for several decades and there are plenty of existing techniques to help with the understanding of unstructured textual content. This presentation will introduce and summarize useful techniques that are helpful in tackling sub-domains of information extraction, such as named entity recognition, keyword extraction and document summarization for efficient retrieval. Additionally, the talk will also emphasize low-resource cases, when not much useful labelled information is available.

Information Extraction from Text, presented @ Deloitte

Deep Kayal

Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...

Michael Hausenblas

What's the fuss about all this metadata?

Sara Sterkenburg

An information environment for neuroscientists

David Wallom

Ensuring Continuing Access to Online Scholarly Resources

EDINA, University of Edinburgh

Digital Cultural Heritage and the new EU Framework Programme

locloud

Semelhante a Speech Technology and Big Data (20)

GBIF BIFA mentoring, Day 5a Data management, July 2016

Born Digital Archives

Importance of Database in Library

IWST 2013: Intro

Kathryn Cassidy - DRI Training Series: 4. Metadata and XML

Cloud Programming Models: eScience, Big Data, etc.

DRI Introduction to Digital Preservation Training- Metadata and xml-Kathryn C...

iUser2011 Keynote: The Personal Information Environment beyond the Personal C...

dbGLOVE (presentation at Silicon Valley Personal Health Technology)

Takeda 101214short-d

Six Use Cases for Edinburgh DataShare

Using islandora to build digital collections - 2016.01.29 OLA 2016

Digital Archive of Knowledge for Sharing and Re-using

Challenges for Linked Data in Japan

Information Extraction from Text, presented @ Deloitte

Keynote - TUT W3C Web Technology Day: Linked Data for Science and Industry, 2...

What's the fuss about all this metadata?

An information environment for neuroscientists

Ensuring Continuing Access to Online Scholarly Resources

Digital Cultural Heritage and the new EU Framework Programme

Último

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

A Principled Technologies deployment guide Conclusion Deploying VMware Cloud Foundation 5.1 on next gen Dell PowerEdge servers brings together critical virtualization capabilities and high-performing hardware infrastructure. Relying on our hands-on experience, this deployment guide offers a comprehensive roadmap that can guide your organization through the seamless integration of advanced VMware cloud solutions with the performance and reliability of Dell PowerEdge servers. In addition to the deployment efficiency, the Cloud Foundation 5.1 and PowerEdge solution delivered strong performance while running a MySQL database workload. By leveraging VMware Cloud Foundation 5.1 and PowerEdge servers, you could help your organization embrace cloud computing with confidence, potentially unlocking a new level of agility, scalability, and efficiency in your data center operations.

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

Principled Technologies

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Real Time Object Detection Using Open CV

Khem

The Good, the Bad and the Governed - Why is governance a dirty word? David O'Neill, Chief Operating Officer - APIContext Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...

apidays

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

AWS Community Day CPH - Three problems of Terraform

Andrey Devyatkin

Three things you will take away from the session: • How to run an effective tenant-to-tenant migration • Best practices for before, during, and after migration • Tips for using migration as a springboard to prepare for Copilot in Microsoft 365 Main ideas: Migration Overview: The presentation covers the current reality of cross-tenant migrations, the triggers, phases, best practices, and benefits of a successful tenant migration Considerations: When considering a migration, it is important to consider the migration scope, performance, customization, flexibility, user-friendly interface, automation, monitoring, support, training, scalability, data integrity, data security, cost, and licensing structure Next Wave: The next wave of change includes the launch of Copilot, which requires businesses to be prepared for upcoming changes related to Copilot and the cloud, and to consolidate data and tighten governance ShareGate: ShareGate can help with pre-migration analysis, configurable migration tool, and automated, end-user driven collaborative governance

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff

sammart93

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

💉💊+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHABI}}+971581248768 +971581248768 Mtp-Kit (500MG) Prices » Dubai [(+971581248768**)] Abortion Pills For Sale In Dubai, UAE, Mifepristone and Misoprostol Tablets Available In Dubai, UAE CONTACT DR.Maya Whatsapp +971581248768 We Have Abortion Pills / Cytotec Tablets /Mifegest Kit Available in Dubai, Sharjah, Abudhabi, Ajman, Alain, Fujairah, Ras Al Khaimah, Umm Al Quwain, UAE, Buy cytotec in Dubai +971581248768''''Abortion Pills near me DUBAI | ABU DHABI|UAE. Price of Misoprostol, Cytotec” +971581248768' Dr.DEEM ''BUY ABORTION PILLS MIFEGEST KIT, MISOPROTONE, CYTOTEC PILLS IN DUBAI, ABU DHABI,UAE'' Contact me now via What's App…… abortion Pills Cytotec also available Oman Qatar Doha Saudi Arabia Bahrain Above all, Cytotec Abortion Pills are Available In Dubai / UAE, you will be very happy to do abortion in Dubai we are providing cytotec 200mg abortion pill in Dubai, UAE. Medication abortion offers an alternative to Surgical Abortion for women in the early weeks of pregnancy. We only offer abortion pills from 1 week-6 Months. We then advise you to use surgery if its beyond 6 months. Our Abu Dhabi, Ajman, Al Ain, Dubai, Fujairah, Ras Al Khaimah (RAK), Sharjah, Umm Al Quwain (UAQ) United Arab Emirates Abortion Clinic provides the safest and most advanced techniques for providing non-surgical, medical and surgical abortion methods for early through late second trimester, including the Abortion By Pill Procedure (RU 486, Mifeprex, Mifepristone, early options French Abortion Pill), Tamoxifen, Methotrexate and Cytotec (Misoprostol). The Abu Dhabi, United Arab Emirates Abortion Clinic performs Same Day Abortion Procedure using medications that are taken on the first day of the office visit and will cause the abortion to occur generally within 4 to 6 hours (as early as 30 minutes) for patients who are 3 to 12 weeks pregnant. When Mifepristone and Misoprostol are used, 50% of patients complete in 4 to 6 hours; 75% to 80% in 12 hours; and 90% in 24 hours. We use a regimen that allows for completion without the need for surgery 99% of the time. All advanced second trimester and late term pregnancies at our Tampa clinic (17 to 24 weeks or greater) can be completed within 24 hours or less 99% of the time without the need surgery. The procedure is completed with minimal to no complications. Our Women's Health Center located in Abu Dhabi, United Arab Emirates, uses the latest medications for medical abortions (RU-486, Mifeprex, Mifegyne, Mifepristone, early options French abortion pill), Methotrexate and Cytotec (Misoprostol). The safety standards of our Abu Dhabi, United Arab Emirates Abortion Doctors remain unparalleled. They consistently maintain the lowest complication rates throughout the nation. Our Physicians and staff are always available to answer questions and care for women in one of the most difficult times in their lives. The decision to have an abortion at the Abortion Cl

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

The value of a flexible API Management solution for Open Banking Steve Melan, Manager for IT Innovation and Architecture - State's and Saving's Bank of Luxembourg Apidays New York 2024: The API Economy in the AI Era (April 30 & May 1, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays New York 2024 - The value of a flexible API Management solution for O...

apidays

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

This presentation explores the impact of HTML injection attacks on web applications, detailing how attackers exploit vulnerabilities to inject malicious code into web pages. Learn about the potential consequences of such attacks and discover effective mitigation strategies to protect your web applications from HTML injection vulnerabilities. for more information visit https://bostoninstituteofanalytics.org/category/cyber-security-ethical-hacking/

HTML Injection Attacks: Impact and Mitigation Strategies

Boston Institute of Analytics

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

Partners Life - Insurer Innovation Award 2024

The Digital Insurer

Tata AIG General Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Speech Technology and Big Data

1. * Nick Campbell Speech Communication Lab Trinity College Dublin, Ireland

2. * * TCD – Stokes Professor (Dublin) * CNGL – PI – Delivery & Interaction * ELRA – board member / VP – speech * ISCA – board member – workshops * IEEE – Sig Proc Soc - SLTC member * ATR/NiCT – research director(Japan) * Speech Prosody 2014 (Dublin) host * Speech scientist/researcher/corpus analyst

3. * AT&T Bell Labs * The ideas people – think ‘BIG’ * IBM UK Scientific Centre * The corpus people – ‘collect it all’ * ATR basic telecom research * The fundamentals - learn how to ‘infer’ from it *

4. * we used to be considered BIG – speech data (and now multimedia) gobbled up memory * I collected 1500 hours of everyday chat/daily conversations in 2000 – (@1GB per minute) - took 5-years to process! * now Apple, Google, Ms, .. get that each minute (but the secret is in the metadata) * we need accessible data & tools for everybody! *

5. * but we need to manage privacy issues first! *

6. * and we need a way to protect IP as well * written publications have ISBN standard * work is now underway (cf ELRA & COCOSDA) to institute ISLRN for Language Resources * researchers need to get credit for corpora as well as for publishing research results * The community needs a way to identify, acknowledge, attribute, and reference data *

7. * tools for processing speech & multimodal data * htk, hts, R, etc . . . not simple to use * little consensus on what features to encode * manual bootstrap – much too time-consuming! *

8. * social interaction * personal idiosyncracies * group dynamics – multimodal data (TB/hr) * issues of robustness / domain specificity / privacy / storage & archiving / redistribution *

9. context analytics: * cultural and language-specific needs * multimodal – multimedia – multilingual * tools for ‘less-well-supported’ languages * e.g., U-STAR consortium for speech research – sharing tools & data & knowledge for research *

10. * European Language Resources Association * COCOSDA – int’l coordinating committee * IEEE SLTC, ISCA SIGS, there are places to go * but are they ready for really BIG data? perhaps not yet . . . *

11. * curricula prepare people * what standards to rely on? * what resources available? * what features to extract? * what tools to work with? * what use to put it to? * what info to hide? * what to do next? *

12. *

Speech Technology and Big Data

Recomendados

Recomendados

Mais conteúdo relacionado

Destaque

Destaque (11)

Semelhante a Speech Technology and Big Data

Semelhante a Speech Technology and Big Data (20)

Último

Último (20)

Speech Technology and Big Data