Speech to text (Speech Recognition API and PyAudio library)

250x250

Notice

Recent Posts

Recent Comments

Link

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

데이터과학 삼학년

Speech to text (Speech Recognition API and PyAudio library) 본문

Machine Learning

Speech to text (Speech Recognition API and PyAudio library)

Dan-k 2020. 6. 18. 16:59

음성을 문자로 변환하는 api를 소개한다.

보통 음성은 대표적인 커뮤니케이션 수단이지만, 분석을 할 때는 제약이 있다.

이에 음성을 텍스트로 변환하는 방법에 대해 알아보고자 한다.

(Hidden Markov Model (HMM), deep neural network models are used to convert the audio into text.)

Hidden Markov Model 을 이용하여 보통 음성을 텍스트로 변환한다.

대표적인 speech to text api로 Speech Recognition api와 pyaudio를 소개하려 한다.

Speech Recognition

Speech Recognition api 는 여러개의 api가 있는데 konlpy처럼... 여기서는 Google에서 제공해주는 api를 사용한다.

지원 언어가 매우 다양하다 (한국어도 물론 포함).

설치

!pip install SpeechRecognition

변환 코드

#import library
import speech_recognition as sr

# Initialize recognizer class (for recognizing the speech)
r = sr.Recognizer()

# Reading Audio file as source
# listening the audio file and store in audio_text variable

with sr.AudioFile('I-dont-know.wav') as source:
    
    audio_text = r.listen(source)
    
# recoginize_() method will throw a request error if the API is unreachable, hence using exception handling
    try:
        
        # using google speech recognition
        text = r.recognize_google(audio_text)
        print('Converting audio transcripts into text ...')
        print(text)
     
    except:
         print('Sorry.. run again...')

#import library

import speech_recognition as sr

# Initialize recognizer class (for recognizing the speech)

r = sr.Recognizer()

# Reading Microphone as source
# listening the speech and store in audio_text variable

with sr.Microphone() as source:
    print("Talk")
    audio_text = r.listen(source)
    print("Time over, thanks")
# recoginize_() method will throw a request error if the API is unreachable, hence using exception handling
    
    try:
        # using google speech recognition
        print("Text: "+r.recognize_google(audio_text))
    except:
         print("Sorry, I did not get that")

https://cloud.google.com/speech-to-text/docs/languages

언어 지원 | Cloud Speech-to-Text 문서 | Google Cloud

이 페이지에는 Cloud Speech-to-Text에서 지원하는 모든 언어가 나와 있습니다. 언어는 인식 요청의 languageCode 매개변수에서 지정됩니다. 언어별로 사용 가능한 클래스 토큰에 대한 자세한 내용은 클��

cloud.google.com

728x90

LIST

'Machine Learning' 카테고리의 다른 글

Text classification using GCP ai-platform (0)	2020.06.26
tf.keras.callbacks.LearningRateScheduler (1)	2020.06.24
Sequence Model (RNN, LSTM) (1)	2020.06.02
Going Faster and Deeper (0)	2020.06.02
Dealing with Data Scarcity (0)	2020.06.02

'Machine Learning' Related Articles

Comments

데이터과학 삼학년 Data scientist 의 소소한 개발(계발) 일기

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

데이터과학 삼학년

데이터과학 삼학년

Speech to text (Speech Recognition API and PyAudio library) 본문

Speech to text (Speech Recognition API and PyAudio library)

'Machine Learning' 카테고리의 다른 글

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역