SlideShare uma empresa Scribd logo
1 de 22
Dacco and qdacco
An Open Source Catalan-English Dictionary
and its GUI
Fosdem 2008 in Brussels, Belgium
English-
Catalan
Catalan-
English
Carles Pina i Estany
carles@pina.cat
2/21
What is the history of Catalan and
what is its future?
● It's a Romance language that derives from
Latin, just like Spanish, Italian, French,
Portuguese and Romanian
● The Catalan linguistic and cultural community
has its own .cat Top Level Domain
● It's an official language of Catalonia, Andorra,
Valencia and the Balearic Islands. It is also
spoken in Alghero (Sardinia) and parts of
southern France
3/21
Map of Europe
Author: Wikipedia
4/21
Where is it spoken?
Author: Josep Gallart
5/21
Speakers as % of EU population
Catalan Portuguese Dutch Spanish
0
2
4
6
8
10
12
14
16
%
Data source: http://ec.europa.eu/public_opinion/archives/ebs/ebs_243_en.pdf
9,1 Million
6/21
What is Dacco?
● English-Catalan dictionary tailored for speakers
of both languages
● All entries are written in XML
– Also exported to PDF for improved usability
– Form-based web search at www.catalandictionary.org
– Plugins available for Firefox and IE
– Gadget for iGoogle homepage
– Catalan verb conjugator
– qdacco (standalone application)
7/21
Dacco's history
● An English student of Catalan found that there
was a lack of Catalan dictionaries available
which were comprehensive and aimed at
English speakers
● This was confirmed by other students of
Catalan through a survey she distributed during
her PhD studies
● The collection of dictionary entries was begun
in 2001. The first version of Dacco was
released in 2003
8/21
● People from all walks of life: linguists but also
mathematicians, geographers, IT professionals.
Who contributes?
Lou Hevly
Josep M.
López
James
Macgill
Linda Oxnard
Oriol Vilaseca
Others:
•Joanathan Kaye
•Gill Martin
•David Gimeno
•Jaume Ortolà
•Max Wheeler
•Margarita Castañón
Leopold Palomo
9/21
The 'average' contributor
● Aged between 20 and 80
● English speaker living in UK, US, Canada or
Australia
● Catalan speaker from Barcelona, Girona,
Tarragona, Valencia or the Balearic Islands
● Common purpose: wanting to create a resource
which is not only useful to language learners
but is also free and open
10/21
Where do we get our entries from?
● We do not copy entries from any dictionary
● Agreement with TermCat to incorporate their
open source vocabulary lists into our dictionary
● Entries are often added by those who use the
dictionary in their daily tasks when they search
for a word and fail to find it
● Recently: we are matching some Catalan-Latin-
English birds list to incorporate
11/21
How does the project work?
● First step: find a new word
– Mailing list: regular contributors send suggestions
– Web form: any user can send suggestions
– 'Missing word lists': created automatically from
online search engine and manually from
suggestions sent through qdacco
● Second Step: discuss the new word's
translation equivalent
– Different meanings according to geographic area,
age, socioeconomic status, etc.
● Third Step: project admin adds word to the XML
12/21
Dacco's Special Features (1/3)
● It's free: anyone can modify, copy or redistribute
– Data (XML files): LGPL
– PDF files: CC Attribution-Share Alike 2.5
– qdacco: GPL 3
● More than 15,000 entries in each side of the
dictionary (over 200 DIN-A4 pages)
13/21
Dacco's Special Features (2/3)
● 4 dictionaries:
– Catalan-English dictionary
– English-Catalan dictionary
– Catalan-English dictionary
– English-Catalan dictionary
● Examples and usage notes tailored according
to user's native language
for Catalan speakers
for English speakers
14/21
Dacco's Special Features (3/3)
● Links to images
● Usage notes
● Examples
● Word frequency counts (from Google)
● Semantic fields: apple ⇒ fruit ⇒ food
15/21
Why is it so important that the
dictionary be open source?
● Culture should be free
● Anybody can incorporate it into their own
application or web site
● Anybody can suggest entries, though every
suggestion is examined and discussed by team
of contributors so that quality of dictionary is
never compromised. Descriptive, not normative
dictionary.
● We encourage others to create their own open
source dictionaries and are happy to share the
benefit of our experience
16/21
What is qdacco?
● Multi-platform (Unix, Linux, Windows*)
standalone application
● Available in standard Debian repositories
● qdacco has been developed since 2005
– Approximately two major releases a year
17/21
Screenshots
18/21
Why did we create qdacco?
● We wanted a reference application for Dacco
● Access to all Dacco resources (photos, links,
examples, notes, etc.)
● Reports missing words to the Dacco Project.
● Integration with Festival (Speech synthesizer)
● Auto-completion and many other features
● Offline searching - faster than looking through a
PDF file on your own.
19/21
qdacco architecture
libqdacco
qdacco textdacco
XML
InternetFestival
Send suggestions
20/21
Similar projects
● GPL German-Catalan dictionary:
– GPL Deutsch-Katalanisches Wörterbuch
– http://www.aldeaglobal.net/diccionari/index.php
● Wiktionary:
– Catalan: http://ca.wiktionary.org/wiki/Portada
– English: http://en.wiktionary.org/wiki/Main_Page
21/21
Thanks for your
attention
Questions?
?
Mail: carles@pina.cat
http://www.catalandictionary.org
22/21
Creative Commons
This work is licensed under the Creative
Commons Attribution 2.5 Spain License. To
view a copy of this license, visit
http://creativecommons.org/licenses/by/2.5/es/
or send a letter to Creative Commons, 171
Second Street, Suite 300, San Francisco,
California, 94105, USA.

Mais conteúdo relacionado

Semelhante a Dacco

META-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeMETA-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeGeorg Rehm
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana
 
Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...TAUS - The Language Data Network
 
Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Prompsit Language Engineering
 
Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital EuropeGeorg Rehm
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummitLaurent Le Meur
 
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeMETA-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeGeorg Rehm
 
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Antoine Isaac
 
Celtic language technologies in the digital age
Celtic language technologies in the digital ageCeltic language technologies in the digital age
Celtic language technologies in the digital agetechiaith
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technologytechiaith
 
Cracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual EuropeCracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual EuropeGeorg Rehm
 
Global education conferencef
Global education conferencefGlobal education conferencef
Global education conferencefelkanj
 
META-NET: Language Technology for Europe
META-NET: Language Technology for EuropeMETA-NET: Language Technology for Europe
META-NET: Language Technology for EuropeGeorg Rehm
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeGeorg Rehm
 
Galician Experience with OpenOffice.org
Galician Experience with OpenOffice.orgGalician Experience with OpenOffice.org
Galician Experience with OpenOffice.orgAlexandro Colorado
 
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, ItalyLangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, ItalyMaria Perifanou
 
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
EMMA Summer School - Maria Perifanou - Language Massive Open Online CoursesEUmoocs
 
The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...Georg Rehm
 

Semelhante a Dacco (20)

META-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for EuropeMETA-NET and META-SHARE: Language Technology for Europe
META-NET and META-SHARE: Language Technology for Europe
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...Apertium: Free/open-source rule-based machine translation and language proces...
Apertium: Free/open-source rule-based machine translation and language proces...
 
Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...
 
Multilingualism for Digital Europe
Multilingualism for Digital EuropeMultilingualism for Digital Europe
Multilingualism for Digital Europe
 
2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit2016 EDRLab roadmap at epubsummit
2016 EDRLab roadmap at epubsummit
 
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual EuropeMETA-NET: Towards a Strategic Research Agenda for Multilingual Europe
META-NET: Towards a Strategic Research Agenda for Multilingual Europe
 
Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021Addressing multilingual challenges at Europeana: An update - DCMI 2021
Addressing multilingual challenges at Europeana: An update - DCMI 2021
 
Celtic language technologies in the digital age
Celtic language technologies in the digital ageCeltic language technologies in the digital age
Celtic language technologies in the digital age
 
Promoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language TechnologyPromoting the Use of Basque via Language Technology
Promoting the Use of Basque via Language Technology
 
SpeakApps presentation
SpeakApps  presentationSpeakApps  presentation
SpeakApps presentation
 
Cracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual EuropeCracking the Language Barrier for a Multilingual Europe
Cracking the Language Barrier for a Multilingual Europe
 
Global education conferencef
Global education conferencefGlobal education conferencef
Global education conferencef
 
META-NET: Language Technology for Europe
META-NET: Language Technology for EuropeMETA-NET: Language Technology for Europe
META-NET: Language Technology for Europe
 
Human Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual EuropeHuman Language Technologies in a Multilingual Europe
Human Language Technologies in a Multilingual Europe
 
Galician Experience with OpenOffice.org
Galician Experience with OpenOffice.orgGalician Experience with OpenOffice.org
Galician Experience with OpenOffice.org
 
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, ItalyLangMOOC project _EMMA Summer School 2015, Ischia, Italy
LangMOOC project _EMMA Summer School 2015, Ischia, Italy
 
Dissemination Strategy Plan
Dissemination Strategy PlanDissemination Strategy Plan
Dissemination Strategy Plan
 
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
EMMA Summer School - Maria Perifanou - Language Massive Open Online Courses
 
The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...The Strategic Impact of META-NET on the Regional, National and International ...
The Strategic Impact of META-NET on the Regional, National and International ...
 

Mais de Carles Pina Estany

Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016Carles Pina Estany
 
ACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 ITACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 ITCarles Pina Estany
 
Expedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la AntártidaExpedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la AntártidaCarles Pina Estany
 
Dynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and PythonDynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and PythonCarles Pina Estany
 

Mais de Carles Pina Estany (8)

Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016Circumnavigating the Antarctic with Python and Django during ACE 2016
Circumnavigating the Antarctic with Python and Django during ACE 2016
 
ACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 ITACE (Antarctic Circumnavigation Expedition) 2016 IT
ACE (Antarctic Circumnavigation Expedition) 2016 IT
 
Expedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la AntártidaExpedición ACE: dando la vuelta la Antártida
Expedición ACE: dando la vuelta la Antártida
 
Seal traveling - Icehack
Seal traveling - IcehackSeal traveling - Icehack
Seal traveling - Icehack
 
Benches
BenchesBenches
Benches
 
Midi madness
Midi madnessMidi madness
Midi madness
 
Olfactory notifications
Olfactory notificationsOlfactory notifications
Olfactory notifications
 
Dynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and PythonDynamic Slides using OpenOffice.org Impress and Python
Dynamic Slides using OpenOffice.org Impress and Python
 

Último

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 

Dacco

  • 1. Dacco and qdacco An Open Source Catalan-English Dictionary and its GUI Fosdem 2008 in Brussels, Belgium English- Catalan Catalan- English Carles Pina i Estany carles@pina.cat
  • 2. 2/21 What is the history of Catalan and what is its future? ● It's a Romance language that derives from Latin, just like Spanish, Italian, French, Portuguese and Romanian ● The Catalan linguistic and cultural community has its own .cat Top Level Domain ● It's an official language of Catalonia, Andorra, Valencia and the Balearic Islands. It is also spoken in Alghero (Sardinia) and parts of southern France
  • 4. 4/21 Where is it spoken? Author: Josep Gallart
  • 5. 5/21 Speakers as % of EU population Catalan Portuguese Dutch Spanish 0 2 4 6 8 10 12 14 16 % Data source: http://ec.europa.eu/public_opinion/archives/ebs/ebs_243_en.pdf 9,1 Million
  • 6. 6/21 What is Dacco? ● English-Catalan dictionary tailored for speakers of both languages ● All entries are written in XML – Also exported to PDF for improved usability – Form-based web search at www.catalandictionary.org – Plugins available for Firefox and IE – Gadget for iGoogle homepage – Catalan verb conjugator – qdacco (standalone application)
  • 7. 7/21 Dacco's history ● An English student of Catalan found that there was a lack of Catalan dictionaries available which were comprehensive and aimed at English speakers ● This was confirmed by other students of Catalan through a survey she distributed during her PhD studies ● The collection of dictionary entries was begun in 2001. The first version of Dacco was released in 2003
  • 8. 8/21 ● People from all walks of life: linguists but also mathematicians, geographers, IT professionals. Who contributes? Lou Hevly Josep M. López James Macgill Linda Oxnard Oriol Vilaseca Others: •Joanathan Kaye •Gill Martin •David Gimeno •Jaume Ortolà •Max Wheeler •Margarita Castañón Leopold Palomo
  • 9. 9/21 The 'average' contributor ● Aged between 20 and 80 ● English speaker living in UK, US, Canada or Australia ● Catalan speaker from Barcelona, Girona, Tarragona, Valencia or the Balearic Islands ● Common purpose: wanting to create a resource which is not only useful to language learners but is also free and open
  • 10. 10/21 Where do we get our entries from? ● We do not copy entries from any dictionary ● Agreement with TermCat to incorporate their open source vocabulary lists into our dictionary ● Entries are often added by those who use the dictionary in their daily tasks when they search for a word and fail to find it ● Recently: we are matching some Catalan-Latin- English birds list to incorporate
  • 11. 11/21 How does the project work? ● First step: find a new word – Mailing list: regular contributors send suggestions – Web form: any user can send suggestions – 'Missing word lists': created automatically from online search engine and manually from suggestions sent through qdacco ● Second Step: discuss the new word's translation equivalent – Different meanings according to geographic area, age, socioeconomic status, etc. ● Third Step: project admin adds word to the XML
  • 12. 12/21 Dacco's Special Features (1/3) ● It's free: anyone can modify, copy or redistribute – Data (XML files): LGPL – PDF files: CC Attribution-Share Alike 2.5 – qdacco: GPL 3 ● More than 15,000 entries in each side of the dictionary (over 200 DIN-A4 pages)
  • 13. 13/21 Dacco's Special Features (2/3) ● 4 dictionaries: – Catalan-English dictionary – English-Catalan dictionary – Catalan-English dictionary – English-Catalan dictionary ● Examples and usage notes tailored according to user's native language for Catalan speakers for English speakers
  • 14. 14/21 Dacco's Special Features (3/3) ● Links to images ● Usage notes ● Examples ● Word frequency counts (from Google) ● Semantic fields: apple ⇒ fruit ⇒ food
  • 15. 15/21 Why is it so important that the dictionary be open source? ● Culture should be free ● Anybody can incorporate it into their own application or web site ● Anybody can suggest entries, though every suggestion is examined and discussed by team of contributors so that quality of dictionary is never compromised. Descriptive, not normative dictionary. ● We encourage others to create their own open source dictionaries and are happy to share the benefit of our experience
  • 16. 16/21 What is qdacco? ● Multi-platform (Unix, Linux, Windows*) standalone application ● Available in standard Debian repositories ● qdacco has been developed since 2005 – Approximately two major releases a year
  • 18. 18/21 Why did we create qdacco? ● We wanted a reference application for Dacco ● Access to all Dacco resources (photos, links, examples, notes, etc.) ● Reports missing words to the Dacco Project. ● Integration with Festival (Speech synthesizer) ● Auto-completion and many other features ● Offline searching - faster than looking through a PDF file on your own.
  • 20. 20/21 Similar projects ● GPL German-Catalan dictionary: – GPL Deutsch-Katalanisches Wörterbuch – http://www.aldeaglobal.net/diccionari/index.php ● Wiktionary: – Catalan: http://ca.wiktionary.org/wiki/Portada – English: http://en.wiktionary.org/wiki/Main_Page
  • 21. 21/21 Thanks for your attention Questions? ? Mail: carles@pina.cat http://www.catalandictionary.org
  • 22. 22/21 Creative Commons This work is licensed under the Creative Commons Attribution 2.5 Spain License. To view a copy of this license, visit http://creativecommons.org/licenses/by/2.5/es/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.

Notas do Editor

  1. No parlar (excepte Carles)
  2. German case
  3. Version for catalan and english Castell diference
  4. Tota la peña del qui-som Non-academic people = carles and I
  5. Mencionar rapid els primers punts, per relacionar-ho amb les fotos del redere
  6. no, except termcat because Free license
  7. Carles Why LGPL Why Creative Commons
  8. Ensaimada example (una imatge val més que 1000 paraules)
  9. as beer should be
  10. libqdacco, textdacco, qdacco, newarchitecture 0.7 windows*
  11. oxford genie
  12. German culture again