SlideShare uma empresa Scribd logo
1 de 21
PresentedPresented
ByBy
Sukhbeer KumarSukhbeer Kumar
SupervisedSupervised
ByBy
Rajan sir & praveen SirRajan sir & praveen Sir
Technical Seminar presentationTechnical Seminar presentation
onon
Voice morphingVoice morphing
1 SIT,MATHURA
SEMINAR OUTLINES
•What It is?
•Need of Voice Morphing
•Description the Morphing.
•Technical details of Morphing.
•Application areas.
SIT,MATHURA
2
Voice Morphing
• Transition Phenomenon.
• Technology developed at the Los Alamos
National Laboratory in New Mexico,
USA by George Papcun .
SIT,MATHURA
3
What it actually
performs ?
• It is a technique to modify a source
speaker's speech to sound as if it
was spoken by a target speaker.
• Voice morphing enables speech
patterns to be cloned
– And an accurate copy of a person's voice
can be made that can wishes to say,
anything in the voice of someone else.
SIT MATHURA4
Need of voice morphing
• Text To Speech (TTS)
• In public speech systems
• For special effects ( just like video
or image morphing is done ).
• To diminish Ethnical barriers.
SIT,MATHURA5
Voice Morphing Process
• Preprocessing or representation
conversion.
• Pitch and Envelope analysis.
• Morphing which includes Warping and
interpolation.
• Signal re-estimation.
SIT,MATHURA6
Block Diagram
SIT,MATHURA7
Pre-Processing
• Involves processes like signal
acquisition in discrete form and
windowing.
SIT,MATHURA8
Pitch And Envelope
Analysis
• This process will extract the pitch.
• Formant information in the speech
signal.
SIT,MATHURA9
Conversion
SIT ,MATHURA10
Matching and Warping
• DTW(Dynamic Time Warping)
- Dynamic Time Warping (DTW) is used to
find the best match between the pitch of the
two sounds.
SIT,MATHURA11
Signal Re-Estimation
• Loss during Signal re-estimation
- Due to signals being transformation into the
cepstral domain, a magnitude function is used.
This results in a loss of phase information in
the representation of the data.
SIT,MATHURA12
Summarized Block
Diagram
SIT,MATHURA13
Limitations
 
•Lots of normalizing problems.
•Some applications require extensive
sound libraries.
•Different languages require different
phonetics.
•It is very seldom complete.
SIT,MATHURA14
Advantages
• Allows speech model to be duplicated
and an exact copy of a person’s voice.
• Powerful combat zone weapon.
SIT,MATHURA15
Disadvantages
• Use to pull out the useful
information.
• It hides the actual identity of the
user.
SIT,MATHURA16
Conclusion
• The approach we have adopted
separates the sounds into two forms:
- Spectral envelope information
- Pitch and voicing information.
• Dynamic Time Warping
- Aligns the sounds with respect to their
pitches.
• Signal re-estimation algorithm.
- Frames are converted back into a time domain
waveform. SIT,MATHURA17
Application Areas
• Fake telephone conversations as
evidence in courts of law.
• Powerful battlefield weapon.
- Provide fake orders to the enemy's troops,
appearing to come from their own commanders.
SIT,MATHURA18
Future Scope
• Extending the functionality of tool.
- Create a powerful and flexible morphing
tool.
• Increased user interaction.
- Graphical User Interface could be designed
and integrated to make the package more
‘user-friendly’.
SIT,MATHURA19
Thank you!!!
SIT,MATHURA20
Questions??
SIT,MATHURA21

Mais conteúdo relacionado

Mais procurados

Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
Sayyed Z
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
International Journal of Science and Research (IJSR)
 
silent sound technology
silent sound technologysilent sound technology
silent sound technology
Najeeb p
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
Diptimaya Sarangi
 

Mais procurados (20)

Voice Morphing
Voice MorphingVoice Morphing
Voice Morphing
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
silent sound technology
silent sound technologysilent sound technology
silent sound technology
 
A seminar report on speech recognition technology
A seminar report on speech recognition technologyA seminar report on speech recognition technology
A seminar report on speech recognition technology
 
Speech Recognition System By Matlab
Speech Recognition System By MatlabSpeech Recognition System By Matlab
Speech Recognition System By Matlab
 
Silent Sound Technology
Silent Sound TechnologySilent Sound Technology
Silent Sound Technology
 
Speech Recognition in Artificail Inteligence
Speech Recognition in Artificail InteligenceSpeech Recognition in Artificail Inteligence
Speech Recognition in Artificail Inteligence
 
Silent sound technology NEW
Silent sound technology NEW Silent sound technology NEW
Silent sound technology NEW
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Unit 1 speech processing
Unit 1 speech processingUnit 1 speech processing
Unit 1 speech processing
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
silent sound technology
silent sound technologysilent sound technology
silent sound technology
 
Voice recognition system
Voice recognition systemVoice recognition system
Voice recognition system
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Silent sound technology
Silent sound technologySilent sound technology
Silent sound technology
 
Introduction to text to speech
Introduction to text to speechIntroduction to text to speech
Introduction to text to speech
 
silent sound technology final report(17321A0432) (1).pdf
silent sound technology final report(17321A0432) (1).pdfsilent sound technology final report(17321A0432) (1).pdf
silent sound technology final report(17321A0432) (1).pdf
 
Speech recognition system seminar
Speech recognition system seminarSpeech recognition system seminar
Speech recognition system seminar
 

Destaque

All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
Jeff Vanderpool
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
Shailesh Dubey
 

Destaque (18)

Sniffer ppt
Sniffer pptSniffer ppt
Sniffer ppt
 
Project Oxygen
Project OxygenProject Oxygen
Project Oxygen
 
NIGHT VISION TECHNOLOGY
NIGHT VISION TECHNOLOGYNIGHT VISION TECHNOLOGY
NIGHT VISION TECHNOLOGY
 
Chip morphing
Chip morphingChip morphing
Chip morphing
 
Polymer memory
Polymer memoryPolymer memory
Polymer memory
 
Build Features, Not Apps
Build Features, Not AppsBuild Features, Not Apps
Build Features, Not Apps
 
White led
White ledWhite led
White led
 
Brain Fingerprinting PPT
Brain Fingerprinting PPTBrain Fingerprinting PPT
Brain Fingerprinting PPT
 
Bubble Power
Bubble PowerBubble Power
Bubble Power
 
Seminar on night vision technology ppt
Seminar on night vision technology pptSeminar on night vision technology ppt
Seminar on night vision technology ppt
 
White led
White ledWhite led
White led
 
DINAMIKA Pitch Deck
DINAMIKA Pitch DeckDINAMIKA Pitch Deck
DINAMIKA Pitch Deck
 
Dinamika Islam Turki dan Indonesia
Dinamika Islam Turki dan IndonesiaDinamika Islam Turki dan Indonesia
Dinamika Islam Turki dan Indonesia
 
All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11All In One Olathe - revised 10-24-11
All In One Olathe - revised 10-24-11
 
How a Robot Fight Works
How a Robot Fight WorksHow a Robot Fight Works
How a Robot Fight Works
 
March 3 2004 for the ai cie
March 3 2004 for the ai cieMarch 3 2004 for the ai cie
March 3 2004 for the ai cie
 
Dinamika Perkembangan K13
Dinamika Perkembangan K13Dinamika Perkembangan K13
Dinamika Perkembangan K13
 
Airborn internet
Airborn internetAirborn internet
Airborn internet
 

Semelhante a Voice morphing

How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
Muhammad Taqi
 
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORMA seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
मनीष राठौर
 

Semelhante a Voice morphing (20)

voice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdfvoice-morphing-101113123852-phpapp011-151211104638.pdf
voice-morphing-101113123852-phpapp011-151211104638.pdf
 
IRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time DomainIRJET- Pitch Detection Algorithms in Time Domain
IRJET- Pitch Detection Algorithms in Time Domain
 
Digital signal processing
Digital signal processingDigital signal processing
Digital signal processing
 
Rayo for XMPP Folks
Rayo for XMPP FolksRayo for XMPP Folks
Rayo for XMPP Folks
 
final ppt BATCH 3.pptx
final ppt BATCH 3.pptxfinal ppt BATCH 3.pptx
final ppt BATCH 3.pptx
 
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
“Comparing ML-Based Audio with ML-Based Vision: An Introduction to ML Audio f...
 
How speech reorganization works
How speech reorganization worksHow speech reorganization works
How speech reorganization works
 
Voice recognition security systems
Voice recognition security systemsVoice recognition security systems
Voice recognition security systems
 
Multimedia seminar ppt
Multimedia seminar pptMultimedia seminar ppt
Multimedia seminar ppt
 
Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01Speechrecognition 100423091251-phpapp01
Speechrecognition 100423091251-phpapp01
 
Sequence to sequence model speech recognition
Sequence to sequence model speech recognitionSequence to sequence model speech recognition
Sequence to sequence model speech recognition
 
Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...Identification of frequency domain using quantum based optimization neural ne...
Identification of frequency domain using quantum based optimization neural ne...
 
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORMA seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
A seminar on INTRODUCTION TO MULTI-RESOLUTION AND WAVELET TRANSFORM
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
Mini Project- Audio Enhancement
Mini Project-  Audio EnhancementMini Project-  Audio Enhancement
Mini Project- Audio Enhancement
 
Introduction to DSP
Introduction to DSPIntroduction to DSP
Introduction to DSP
 
Speech based password authentication system on FPGA
Speech based password authentication system on FPGASpeech based password authentication system on FPGA
Speech based password authentication system on FPGA
 
Petralex Speech Communication (EN)
Petralex Speech Communication (EN)Petralex Speech Communication (EN)
Petralex Speech Communication (EN)
 
A Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving SystemA Study on the Video Scene Retrieving System
A Study on the Video Scene Retrieving System
 
Listening at the Cocktail Party with Deep Neural Networks and TensorFlow
Listening at the Cocktail Party with Deep Neural Networks and TensorFlowListening at the Cocktail Party with Deep Neural Networks and TensorFlow
Listening at the Cocktail Party with Deep Neural Networks and TensorFlow
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Voice morphing

  • 1. PresentedPresented ByBy Sukhbeer KumarSukhbeer Kumar SupervisedSupervised ByBy Rajan sir & praveen SirRajan sir & praveen Sir Technical Seminar presentationTechnical Seminar presentation onon Voice morphingVoice morphing 1 SIT,MATHURA
  • 2. SEMINAR OUTLINES •What It is? •Need of Voice Morphing •Description the Morphing. •Technical details of Morphing. •Application areas. SIT,MATHURA 2
  • 3. Voice Morphing • Transition Phenomenon. • Technology developed at the Los Alamos National Laboratory in New Mexico, USA by George Papcun . SIT,MATHURA 3
  • 4. What it actually performs ? • It is a technique to modify a source speaker's speech to sound as if it was spoken by a target speaker. • Voice morphing enables speech patterns to be cloned – And an accurate copy of a person's voice can be made that can wishes to say, anything in the voice of someone else. SIT MATHURA4
  • 5. Need of voice morphing • Text To Speech (TTS) • In public speech systems • For special effects ( just like video or image morphing is done ). • To diminish Ethnical barriers. SIT,MATHURA5
  • 6. Voice Morphing Process • Preprocessing or representation conversion. • Pitch and Envelope analysis. • Morphing which includes Warping and interpolation. • Signal re-estimation. SIT,MATHURA6
  • 8. Pre-Processing • Involves processes like signal acquisition in discrete form and windowing. SIT,MATHURA8
  • 9. Pitch And Envelope Analysis • This process will extract the pitch. • Formant information in the speech signal. SIT,MATHURA9
  • 11. Matching and Warping • DTW(Dynamic Time Warping) - Dynamic Time Warping (DTW) is used to find the best match between the pitch of the two sounds. SIT,MATHURA11
  • 12. Signal Re-Estimation • Loss during Signal re-estimation - Due to signals being transformation into the cepstral domain, a magnitude function is used. This results in a loss of phase information in the representation of the data. SIT,MATHURA12
  • 14. Limitations   •Lots of normalizing problems. •Some applications require extensive sound libraries. •Different languages require different phonetics. •It is very seldom complete. SIT,MATHURA14
  • 15. Advantages • Allows speech model to be duplicated and an exact copy of a person’s voice. • Powerful combat zone weapon. SIT,MATHURA15
  • 16. Disadvantages • Use to pull out the useful information. • It hides the actual identity of the user. SIT,MATHURA16
  • 17. Conclusion • The approach we have adopted separates the sounds into two forms: - Spectral envelope information - Pitch and voicing information. • Dynamic Time Warping - Aligns the sounds with respect to their pitches. • Signal re-estimation algorithm. - Frames are converted back into a time domain waveform. SIT,MATHURA17
  • 18. Application Areas • Fake telephone conversations as evidence in courts of law. • Powerful battlefield weapon. - Provide fake orders to the enemy's troops, appearing to come from their own commanders. SIT,MATHURA18
  • 19. Future Scope • Extending the functionality of tool. - Create a powerful and flexible morphing tool. • Increased user interaction. - Graphical User Interface could be designed and integrated to make the package more ‘user-friendly’. SIT,MATHURA19