KR102137155B1

KR102137155B1 - Telecommunication service system and method using speech recognition technology

Info

Publication number: KR102137155B1
Application number: KR1020180143704A
Authority: KR
Inventors: 이상엽
Original assignee: 이상엽
Priority date: 2018-11-20
Filing date: 2018-11-20
Publication date: 2020-07-23
Anticipated expiration: 2038-11-20
Also published as: KR20200058958A

Abstract

본 발명에 따른 음성인식을 이용한 통화 서비스 시스템은, 서비스 이용자를 위한 서비스 이용장치와, 서비스 이용장치에 통화 서비스를 제공하기 위해 네트워크를 통해 서비스 이용장치와 통신하는 서비스 서버를 포함한다. 서비스 이용장치는, 마이크와, 스피커와, 디스플레이부와, 네트워크를 통해 서비스 서버와 통신하는 통신부와, 제어부를 구비한다. 서비스 서버는, 네트워크를 통해 서비스 이용장치와 통신하는 통신부와, 서비스 이용장치로부터 수신하는 음성을 텍스트로 변환하는 STT 변환부와, STT 변환부에 의해 변환된 텍스트를 저장하는 저장부와, 저장부에 저장된 텍스트를 분석하는 TA 분석부와, TA 분석부가 분석한 결과를 기록하는 분석 결과 DB와, 제어부를 구비한다. STT 변환부에 의해 변환되는 텍스트는 서비스 이용장치에 제공되어 디스플레이부에 실시간으로 디스플레이된다.The call service system using voice recognition according to the present invention includes a service using device for a service user, and a service server communicating with the service using device through a network to provide a call service to the service using device. The service using apparatus includes a microphone, a speaker, a display unit, a communication unit communicating with a service server via a network, and a control unit. The service server includes a communication unit communicating with a service using device through a network, an STT converting unit converting voice received from the service using device into text, a storage unit storing text converted by the STT converting unit, and a storage unit It has a TA analysis unit for analyzing the text stored in the, analysis result DB for recording the analysis results of the TA analysis unit, and a control unit. The text converted by the STT conversion unit is provided to the service using device and displayed in real time on the display unit.

Description

Telephony service system and method using voice recognition {TELECOMMUNICATION SERVICE SYSTEM AND METHOD USING SPEECH RECOGNITION TECHNOLOGY}

본 발명은 통화 서비스 시스템에 관한 것으로, 더욱 상세하게는 통화 중 이용자의 음성을 인식하여 텍스트로 변환하고 변환된 텍스트를 이용자에게 실시간으로 제공하거나 분석하여 다양한 부가 서비스를 제공할 수 있는 음성인식을 이용한 통화 서비스 시스템 및 방법에 관한 것이다.The present invention relates to a call service system, and more specifically, recognizes a user's voice during a call, converts it into text, and provides or analyzes the converted text in real time to the user using voice recognition to provide various additional services. It relates to a call service system and method.

최근 정보 통신 기술의 발달로 스마트폰, 태블릿 PC, 스마트 가전과 같이 통신 기능이 부여된 다양한 통신 단말기의 보급이 대중화되었다.2. Description of the Related Art With the recent development of information and communication technology, the spread of various communication terminals with communication functions such as smartphones, tablet PCs, and smart home appliances has become popular.

사용자는 통신 단말기를 이용하여 장소와 시간의 구애 없이 타인과 통화하거나 메시지를 송수신하고, 인터넷에 접속하여 원하는 정보를 검색할 수 있으며, 통신 단말기에 탑재된 위치 센서를 이용하여 타인과 위치를 공유하는 등의 다양한 서비스를 제공받을 수 있다.A user can use a communication terminal to call or send messages to or receive messages from other people regardless of place and time, access the Internet to search for desired information, and share the location with others using the location sensor mounted on the communication terminal. Various services such as can be provided.

초기의 통신 단말기는 하드웨어 성능의 제약으로 인해 다수의 애플리케이션을 동시에 수행하는 것에 한계가 있었으나, 최근 통신 단말기의 하드웨어 성능이 발달하여 정보 처리 속도가 향상되고 메모리의 용량이 커짐에 따라 휴대 단말기에 설치된 다양한 애플리케이션을 동시에 실행할 수 있는 멀티태스킹이 가능하게 되었다.In the early days, communication terminals were limited in performing multiple applications at the same time due to limitations in hardware performance, but recently, as hardware performance of communication terminals has improved, information processing speed has increased and memory capacity has increased. Multitasking, which allows applications to run concurrently, is now possible.

이러한 통신 단말기는 통신 기술의 발달 및 사용자들의 사용이 증가함에 따라 전통적인 음성 통화 서비스 및 문자 서비스에서 벗어나 사용자의 욕구에 맞게 보다 다양한 기능을 구비하여 사용자의 편의를 제공하고 있다. 예를 들어, 통신 단말기를 통해 일대일 통화뿐만 아니라 복수의 이용자 간에 통화가 가능한 컨퍼런스 콜 서비스가 다양한 형태로 서비스되고 있다.These communication terminals provide convenience for users by providing more various functions according to the needs of users, away from the traditional voice call service and text service as the development of communication technology and the use of users increase. For example, a conference call service capable of making a one-to-one call through a communication terminal as well as a call between a plurality of users has been provided in various forms.

컨퍼런스 콜 서비스는 복수의 유무선 통신 단말기의 사용자 간에 이루어지는 통화 서비스를 일컫는다. 이러한 통화 서비스는 서로 다른 지역에 있는 복수의 이용자들이 동일한 장소에 모이지 않고서도 유무선 통신 단말기를 통해 회의나 모임을 진행 할 수 있기 때문에 시간과 비용을 절약할 수 있는 이점이 있다.Conference call service refers to a call service between users of a plurality of wired and wireless communication terminals. Such a call service has an advantage of saving time and money because a plurality of users in different regions can conduct a meeting or a meeting through a wired or wireless communication terminal without having to gather in the same place.

일반적인 통신 단말기를 이용하여 음성 통화나 영상 통화로 컨퍼런스 콜 서비스를 이용하는 경우에는 대화자가 주고 받은 통화 내용이 단순히 서로에게 전달되기만 할 뿐, 별도로 이를 기록하는 방법이 제공되지 않는다. 따라서, 통화 내용을 기록으로 보관하기 위해서는 통화 중 필요한 정보를 사용자가 수기로 기록하거나, 녹음해야 하는 불편함이 있다.When a conference call service is used for a voice call or a video call using a general communication terminal, the contents of a call exchanged by the talker are simply transmitted to each other, and a method for recording them separately is not provided. Therefore, in order to keep the contents of the call as a record, there is a inconvenience in that the user needs to manually record or record the necessary information during the call.

공개특허공보 제2013-0092369호 (2013. 08. 20)Published Patent Publication No. 2013-0092369 (2013. 08. 20)

본 발명은 상술한 바와 같은 점을 감안하여 안출된 것으로, 통화 중 이용자의 음성을 인식하여 텍스트로 변환 및 저장하고, 변환된 텍스트를 이용자에게 실시간으로 제공하거나 분석하여 연관 정보를 제공하고, 저장 정보를 이용하여 새로운 수익 모델을 창출할 수 있는 음성인식을 이용한 통화 서비스 시스템 및 방법을 제공하는 것을 목적으로 한다.The present invention has been devised in view of the above-mentioned points, and recognizes a user's voice during a call, converts and stores the text, and provides or analyzes the converted text in real time to provide relevant information, and stores information It is an object to provide a call service system and method using voice recognition that can create a new revenue model using.

상술한 바와 같은 목적을 해결하기 위한 본 발명에 따른 음성인식을 이용한 통화 서비스 시스템은, 서비스 이용자를 위한 서비스 이용장치; 및 상기 서비스 이용장치에 통화 서비스를 제공하기 위해 네트워크를 통해 상기 서비스 이용장치와 통신하는 서비스 서버;를 포함하고, 상기 서비스 이용장치는, 마이크와, 스피커와, 디스플레이부와, 상기 네트워크를 통해 상기 서비스 서버와 통신하는 통신부와, 상기 마이크와, 상기 스피커와, 상기 디스플레이부와, 상기 통신부와 전기적으로 연결되는 제어부를 구비하고, 상기 서비스 서버는, 상기 네트워크를 통해 상기 서비스 이용장치와 통신하는 통신부와, 상기 서비스 이용장치로부터 수신하는 음성을 텍스트로 변환하는 STT 변환부와, 상기 STT 변환부에 의해 변환된 텍스트를 저장하는 저장부와, 상기 저장부에 저장된 텍스트를 분석하는 TA 분석부와, 상기 TA 분석부가 분석한 결과를 기록하는 분석 결과 DB와, 상기 STT 변환부와, 상기 저장부와, 상기 TA 분석기와, 상기 분석 결과 DB와 전기적으로 연결되는 제어부를 구비하며, 상기 STT 변환부에 의해 변환된 텍스트가 상기 서비스 이용장치에 제공되어 상기 디스플레이부에 실시간으로 디스플레이되는 것을 특징으로 한다.A call service system using voice recognition according to the present invention for solving the above object includes: a service using device for a service user; And a service server communicating with the service using device through a network to provide a call service to the service using device, wherein the service using device includes: a microphone, a speaker, a display unit, and the network. A communication unit for communicating with a service server, a microphone, the speaker, the display unit, and a control unit electrically connected to the communication unit, the service server, the communication unit for communicating with the service using device through the network And, STT conversion unit for converting the voice received from the service using device to text, a storage unit for storing the text converted by the STT conversion unit, a TA analysis unit for analyzing the text stored in the storage unit, It has an analysis result DB for recording the analysis result of the TA analysis unit, the STT conversion unit, the storage unit, the TA analyzer, and a control unit electrically connected to the analysis result DB, the STT conversion unit Characterized in that the converted text is provided to the service using device and displayed in real time on the display unit.

상기 서비스 서버는, 상기 분석 결과 DB에 저장된 데이터로부터 특정 질문과 이에 매칭되는 예상 답변을 생성하여 데이터베이스화 하는 질문 답변 생성부를 포함할 수 있다.The service server may include a question answer generator for generating a database by generating a specific question and an expected answer matching it from data stored in the analysis result DB.

상기 서비스 서버는, 상기 STT 변환부에 의해 변환되는 텍스트 중에 상기 질문 답변 생성부에 저장된 질문에 대응하는 텍스트가 생성되는 경우, 상기 질문 답변 생성부에 저장된 예상 답변 중 해당 질문에 매칭되는 예상 답변을 상기 서비스 이용장치에 제공하여 상기 디스플레이부에 디스플레이되도록 할 수 있다.When the text corresponding to the question stored in the question answer generator is generated among the text converted by the STT converter, the service server provides an expected answer matching the question among the expected answers stored in the question answer generator. It can be provided to the service using device to be displayed on the display unit.

상기 서비스 서버는, 통화 내용을 평가할 수 있도록 인증된 컨설턴트가 상기 저장부에 저장된 통화 내용에 대해 평가를 수행한 결과를 수신하고, 컨설턴트의 평가 결과를 기록하는 평가 기록부를 포함할 수 있다.The service server may include an evaluation recording unit that receives a result of the evaluation performed by a consultant authorized to evaluate the call content and evaluates the call content stored in the storage unit and records the evaluation result of the consultant.

상기 서비스 서버는, 이용자가 자신의 통화 내용에 대한 컨설턴트의 평가를 요청하는 경우, 상기 평가 기록부에 저장된 평가 결과를 이용자에게 제공할 수 있다.The service server may provide the evaluation result stored in the evaluation recording unit to the user when the user requests the consultant's evaluation of the contents of his or her call.

상기 서비스 서버는, 상기 저장부에 저장된 통화 내용을 평가할 수 있도록 인증된 컨설턴트들의 정보를 관리 및 상기 서비스 이용장치에 제공할 수 있는 컨설턴트 관리부를 포함하고, 상기 서비스 이용장치는, 상기 컨설턴트 관리부로부터 컨선턴트들의 정보를 수신하고 이용자가 자신의 통화 내용에 대해 평가를 받을 수 있는 컨설턴트를 선택할 수 있는 인터페이스를 제공할 수 있다.The service server includes a consultant management unit capable of managing and providing information of consultants authorized to evaluate the contents of calls stored in the storage unit to the service using apparatus, and the service using apparatus is configured by the consultant management unit. It can provide an interface to receive information from the stunts and allow the user to select a consultant who can be evaluated on the contents of their calls.

상기 서비스 서버는, 서비스 비용 결제를 위한 결제부를 포함하고, 이용자가 자신의 통화 내용에 대한 컨설턴트의 평가를 요청하는 경우, 상기 결제부를 통한 결제 여부를 확인하고 결제가 이루어진 경우에 한해 상기 평가 기록부에 저장된 평가 결과를 이용자에게 제공할 수 있다.The service server includes a payment unit for payment of service costs, and when a user requests a consultant's evaluation of the contents of his/her call, the payment is confirmed in the evaluation record only when the payment is made through the payment unit Stored evaluation results can be provided to the user.

본 발명에 따른 음성인식을 이용한 통화 서비스 시스템은, 상기 네트워크를 통해 상기 서비스 서버와 통신하고, 컨설턴트가 상기 저장부에 저장된 통화 내용에 대해 평가를 수행할 수 있는 인터페이스를 제공하는 컨설팅 이용장치;를 포함할 수 있다.The call service system using voice recognition according to the present invention includes: a consulting use device that communicates with the service server through the network and provides an interface through which a consultant can perform evaluation on call contents stored in the storage unit; It can contain.

상기 서비스 이용장치는, 이용자가 상기 디스플레이부에 표시되는 텍스트를 선택할 수 있는 인터페이스를 제공하고, 이용자에 의해 선택되는 텍스트에 대한 번역 텍스트를 상기 디스플레이부에 표시할 수 있다.The service using apparatus may provide an interface through which a user can select text displayed on the display unit, and display translated text for text selected by the user on the display unit.

상기 서비스 이용장치는 입력부를 구비하고, 상기 입력부를 통해 입력되는 텍스트를 다른 언어로 번역한 번역 텍스트를 상기 디스플레이부에 표시할 수 있다.The service using apparatus may include an input unit, and display the translated text obtained by translating text input through the input unit into another language.

한편, 상술한 바와 같은 목적을 해결하기 위한 본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, (a) 복수의 이용자가 각각 서비스 이용장치를 이용하여 네트워크를 통해 서비스 서버에 접속하는 단계; (b) 상기 서비스 서버가 상기 복수의 서비스 이용장치로부터 음성을 수신하는 단계; (c) 상기 서비스 서버의 STT 변환부가 수신되는 음성을 텍스트로 변환하는 단계; (d) 상기 서비스 서버가, 상기 STT 변환부에 의해 변환되는 텍스트를 상기 서비스 서버에 구비되는 저장부에 저장하고, 상기 복수의 서비스 이용장치에 각각 제공하여 상기 복수의 서비스 이용장치에 각각 구비되는 디스플레이부에 실시간으로 디스플레이하는 단계; (e) 상기 서비스 서버에 구비되는 TA 분석부가 상기 저장부에 저장되는 텍스트를 분석하는 단계; 및 (f) 상기 서비스 서버에 구비되는 분석 결과 DB가 상기 TA 분석부가 분석한 결과를 카테고리 별로 정리하여 기록하는 단계;를 포함한다.On the other hand, the call service method using a voice recognition according to the present invention for solving the above-described object, (a) a plurality of users, each using a service using the device to access the service server through the network; (b) the service server receiving a voice from the plurality of service using devices; (c) converting the voice received by the STT converter of the service server into text; (d) the service server stores text converted by the STT conversion unit in a storage unit provided in the service server, and provides each to the plurality of service use devices, and is provided in each of the plurality of service use devices Displaying in real time on the display unit; (e) analyzing a text stored in the storage unit by a TA analysis unit provided in the service server; And (f) the analysis result DB provided in the service server, recording the results analyzed by the TA analysis unit by category.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 (f) 단계 이후, (g) 상기 서비스 서버의 질문 답변 생성부가 상기 분석 결과 DB에 저장된 데이터로부터 특정 질문과 이에 매칭되는 예상 답변을 생성하여 데이터베이스화 하는 단계;를 포함할 수 있다.In the call service method using voice recognition according to the present invention, after the step (f), (g) the question answer generation unit of the service server generates a specific question and an expected answer matching it from data stored in the analysis result DB Databaseization; may include.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 (g) 단계 이후에, (h) 상기 서비스 서버가, 상기 복수의 서비스 이용장치로부터 음성을 수신하고, 수신되는 음성을 상기 STT 변환부를 통해 텍스트로 변환하며, 상기 STT 변환부에 의해 변환되는 텍스트를 상기 TA 분석부로 분석하는 과정을 반복하면서 상기 복수의 이용자 중 질문자와 답변자를 구분하고, 상기 STT 변환부에 의해 변환되는 텍스트 중에 상기 질문 답변 생성부에 저장된 질문에 대응하는 텍스트가 생성되는 경우, 상기 질문 답변 생성부에 저장된 예상 답변 중 해당 질문에 매칭되는 예상 답변을 상기 복수의 서비스 이용장치 중에서 답변자의 서비스 이용장치에 제공하여 해당 서비스 이용장치의 디스플레이부에 디스플레이되도록 하는 단계;를 포함할 수 있다.In the call service method using voice recognition according to the present invention, after step (g), (h) the service server receives the voices from the plurality of service use devices, and receives the voices through the STT converter. Converting to text, and repeating the process of analyzing the text converted by the STT conversion unit to the TA analysis unit, distinguishing the questioner and the answerer among the plurality of users, and answering the question among the text converted by the STT conversion unit When the text corresponding to the question stored in the generating unit is generated, the expected answer matching the corresponding question among the expected answers stored in the question answer generating unit is provided to an answerer's service using device among the plurality of service using devices to use the corresponding service It may include a; to be displayed on the display unit of the device.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 (d) 단계 이후에, (i) 상기 서비스 서버의 평가 기록부가, 통화 내용을 평가할 수 있도록 인증된 컨설턴트로부터 상기 저장부에 저장된 통화 내용에 대해 평가 결과를 입력받고 평가 결과를 기록하는 단계; 및 (j) 이용자가 자신의 통화 내용에 대한 컨설턴트의 평가를 자신의 서비스 이용장치를 통해 상기 서비스 서버에 요청하는 경우, 상기 서비스 서버가 상기 평가 기록부의 데이터로부터 해당 이용자에 대한 평가 결과를 검색하여 해당 이용자의 서비스 이용장치에 제공하는 단계;를 포함할 수 있다.In the call service method using voice recognition according to the present invention, after the step (d), (i) the evaluation recorder of the service server, from a consultant authorized to evaluate the call content to the call content stored in the storage unit Inputting an evaluation result to the user and recording the evaluation result; And (j) when the user requests the consultant's evaluation of the content of his or her call through the service using device, the service server retrieves the evaluation result for the user from the evaluation record data. And providing the user's service using device.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 (i) 단계 이전에, (k) 상기 서비스 서버의 컨설턴트 관리부가 통화 내용을 평가할 수 있도록 인증된 컨설턴트들의 정보를 저장하고, 저장된 컨설턴트들의 명단과 정보를 상기 서비스 이용장치에 제공하는 단계; 및 (l) 이용자가 상기 서비스 이용장치에 제공된 컨설턴트 명단 중에서 특정 컨설턴트를 선택하여 자신의 통화 내용에 대한 평가를 자신의 서비스 이용장치를 통해 상기 서비스 서버에 요청하는 경우, 상기 서비스 서버가 상기 컨설턴트 관리부에 등록되어 있는 해당 컨설턴트의 연락처로 통지하고, 해당 컨설턴트로부터 평가 요청 이용자의 통화 내용에 대한 평가를 입력받는 단계;를 포함할 수 있다.In the call service method using voice recognition according to the present invention, before step (i), (k) the consultant management unit of the service server stores information of authorized consultants to evaluate the call content, and a list of stored consultants Providing information and information to the service using device; And (l) when the user selects a specific consultant from the list of consultants provided to the service using apparatus and requests the service server to evaluate the contents of his or her call through the service using apparatus, the service server receives the consultant management unit. It may include the step of notifying to the contact information of the consultant registered in, and receiving an evaluation of the call content of the user requesting the evaluation from the consultant.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 (d) 단계 이후에, (m) 이용자가 상기 디스플레이부에 표시되는 텍스트를 선택할 수 있는 인터페이스를 제공하고, 이용자에 의해 선택되는 텍스트에 대한 번역 텍스트를 상기 디스플레이부에 표시하는 단계;를 포함할 수 있다.The call service method using voice recognition according to the present invention provides, after step (d), (m) an interface for a user to select text displayed on the display unit, and for text selected by the user. And displaying the translated text on the display unit.

본 발명에 따른 음성인식을 이용한 통화 서비스 방법은, 상기 서비스 이용장치를 통해 텍스트를 입력받고, 해당 텍스트를 다른 언어로 번역한 번역 텍스트를 상기 디스플레이부에 표시하는 단계;를 포함할 수 있다.The call service method using voice recognition according to the present invention may include the steps of receiving text through the service using apparatus and displaying the translated text in which the text is translated into another language on the display unit.

본 발명에 따르면, 이용자들이 원격 인터뷰, 원격 회의 등 다양한 목적으로, 음성 통화 또는 영상 통화, 일 대 일 통화, 일 대 다수 통화, 다수 대 다수 통화 등 다양한 형태의 통화를 가능하게 하는 통화 서비스를 제공하면서, 이용자들 간의 대화 내용을 텍스트로 변환하여 실시간으로 이용자들에게 제공할 수 있다. 따라서, 이용자들 간의 더욱 원활하고 정확한 통화를 가능하게 한다.According to the present invention, for various purposes such as remote interviews, teleconferences, and the like, a voice call or video call, one-to-one call, one-to-many call, multiple-to-many call, etc. are provided. While, it is possible to convert conversations between users into text and provide them to users in real time. Therefore, it enables a more smooth and accurate call between users.

또한, 본 발명에 따르면, 이용자들 간의 대화 내용을 텍스트로 변환하여 저장함으로써, 인력에 의한 별도의 녹취록을 작성하지 않아도 이용자에게 녹취록을 제공할 수 있다.In addition, according to the present invention, by storing the conversation contents between users by converting it into text, it is possible to provide a recording record to a user without creating a separate recording record by a manpower.

또한, 본 발명에 따르면, 통화 내용이 변환된 텍스트를 저장하고, 저장된 텍스트를 서비스 서버가 분석하여 특정 단어나 문구에 대해 이와 관련한 다른 단어나 문구를 광범위하게 매칭하여 카테고리 별로 분류하고 데이터베이스화 할 수 있다. 이러한 데이터는 다양한 분야예서 유용하게 활용될 수 있는 빅데이터로 제공될 수 있다.In addition, according to the present invention, it is possible to classify and categorize and categorize different words or phrases related to a specific word or phrase for a specific word or phrase by analyzing the stored text and storing the converted text. have. Such data can be provided as big data that can be usefully utilized in various fields.

또한, 본 발명에 따르면, 서비스 서버가 이용자들 간의 통화가 진행되는 동안, 특정 질문에 대한 예상 답변을 필요한 이용자에게 제공할 수 있다. 따라서, 원격 인터뷰나, 외국어로 진행되는 미팅 중에 답변을 해야 하는 이용자에게 유용하게 활용될 수 있다.In addition, according to the present invention, the service server can provide an expected answer to a specific question to a user in need while a call between users is in progress. Therefore, it can be useful for users who need to answer during a remote interview or a meeting conducted in a foreign language.

또한, 본 발명에 따르면, 이용자들 간의 통화가 진행되는 동안, 서비스 이용장치에 디스플레이되는 텍스트를 다른 언어로 번역한 번역 텍스트를 제공할 수 있다. 따라서, 외국어가 익숙하지 않은 이용자가 외국어로 진행되는 원격 미팅이나 원격 인터뷰에 편리하게 이용할 수 있고, 외국인과의 원활한 원격 미팅이나 원격 인터뷰를 가능하게 해준다.In addition, according to the present invention, it is possible to provide a translated text in which text displayed on the service using device is translated into another language during a call between users. Therefore, a user who is not familiar with a foreign language can conveniently use a remote meeting or a remote interview conducted in a foreign language, and enables a smooth remote meeting or a remote interview with a foreigner.

또한, 본 발명에 따르면, 이용자가 자신의 통화 내용에 대해 전문적인 컨설턴트로부터 컨설팅을 받을 수 있다. 즉, 이용자는 자신이 과거에 수행했던 원격 인터뷰나, 원격 외국어 미팅 등에 대해 컨설턴트의 평가를 받음으로써, 자신의 부족한 부분에 대해 정보를 얻을 수 있다.In addition, according to the present invention, the user can receive consulting from a professional consultant on the content of his or her call. In other words, the user can obtain information about his/her lacking part by receiving a consultant's evaluation on a remote interview or a remote foreign language meeting he has conducted in the past.

도 1은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 개략적으로 나타낸 것이다.
도 2는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 서비스 이용장치를 나타낸 블록도이다.
도 3은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 서비스 서버를 나타낸 블록도이다.
도 4는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 컨설팅 이용장치를 나타낸 블록도이다.
도 5는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 이용한 음성인식을 이용한 통화 서비스 방법을 설명하기 위한 순서도이다.
도 6 내지 도 8은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 서비스 이용장치에 구비되는 디스플레이부에 표시되는 예시적인 화면들을 나타낸 것이다.
도 9는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 이용한 통화 내용 컨설팅 방법을 설명하기 위한 순서도이다.
도 10은 서비스 이용장치의 변형예를 나타낸 것이다.1 schematically shows a call service system using voice recognition according to an embodiment of the present invention.
2 is a block diagram showing a service using apparatus of a call service system using voice recognition according to an embodiment of the present invention.
3 is a block diagram showing a service server of a call service system using voice recognition according to an embodiment of the present invention.
4 is a block diagram showing a consulting service apparatus for a call service system using voice recognition according to an embodiment of the present invention.
5 is a flowchart illustrating a call service method using voice recognition using a call service system using voice recognition according to an embodiment of the present invention.
6 to 8 illustrate exemplary screens displayed on a display unit provided in a service using apparatus of a call service system using voice recognition according to an embodiment of the present invention.
9 is a flow chart for explaining a call content consulting method using a call service system using voice recognition according to an embodiment of the present invention.
10 shows a modification of the service using device.

이하, 본 발명에 따른 음성인식을 이용한 통화 서비스 시스템 및 방법을 도면을 참조하여 상세히 설명한다.Hereinafter, a call service system and method using voice recognition according to the present invention will be described in detail with reference to the drawings.

도 1은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 개략적으로 나타낸 것이고, 도 2는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 서비스 이용장치를 나타낸 블록도이고, 도 3은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 서비스 서버를 나타낸 블록도이며, 도 4는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템의 컨설팅 이용장치를 나타낸 블록도이다.1 is a block diagram schematically showing a call service system using voice recognition according to an embodiment of the present invention, and FIG. 2 is a block diagram showing a service using apparatus of a call service system using voice recognition according to an embodiment of the present invention. 3 is a block diagram showing a service server of a call service system using voice recognition according to an embodiment of the present invention, and FIG. 4 is consulting use of a call service system using voice recognition according to an embodiment of the present invention It is a block diagram showing the device.

도면에 나타낸 것과 같이, 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템(100)은 서비스 이용자를 위한 서비스 이용장치(200)와, 서비스 이용장치(200)에 통화 서비스를 제공하기 위한 서비스 서버(300)와, 컨설턴트를 위한 컨설팅 이용장치(400)를 포함한다. 서비스 서버(300)는 네트워크(500)를 통해 서비스 이용장치(200) 및 컨설팅 이용장치(400)와 각각 통신 가능하게 연결된다. 여기에서, 네트워크(500)는 유선 네트워크 또는 무선 네트워크일 수 있다.As shown in the figure, the call service system 100 using voice recognition according to an embodiment of the present invention provides a service using apparatus 200 for a service user and a service using the service using apparatus 200 It includes a service server 300 and a consulting use device 400 for a consultant. The service server 300 is communicatively connected to the service use device 200 and the consulting use device 400 through the network 500, respectively. Here, the network 500 may be a wired network or a wireless network.

이러한 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템(100)은 이용자들에게 통화 서비스를 제공하는 것으로, 이용자들의 음성을 인식하여 이용자들의 대화 분석을 통한 빅데이터 서비스, 영어나 중국어 등 외국어의 실시간 번역 등 다양한 서비스를 제공할 수 있다. 여기에서, 음성인식은 음향학적 신호(acoustic speech signal)를 텍스트로 매핑시키는 과정으로, 마이크 등을 통하여 얻어진 음향학적 신호를 단어나 단어 집합 또는 문장으로 변환하는 과정을 말한다. 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템(100)의 통화 서비스는 음성 통화 서비스와 영상 통화 서비스를 모두 포함할 수 있다.The call service system 100 using voice recognition according to an embodiment of the present invention provides a call service to users, recognizes the voices of users, and uses big data services through conversation analysis of users, such as English or Chinese. It can provide various services such as real-time translation of foreign languages. Here, speech recognition is a process of mapping an acoustic speech signal into text, and refers to a process of converting an acoustic signal obtained through a microphone into a word, a word set, or a sentence. The call service of the call service system 100 using voice recognition according to an embodiment of the present invention may include both a voice call service and a video call service.

도 1 및 도 2에 나타낸 것과 같이, 서비스 이용장치(200)는 음성 입력을 위한 마이크(210)와, 소리를 출력하는 스피커(220)와, 화면을 출력하는 디스플레이부(215)와, 네트워크(500)를 통해 서비스 서버(300)와 통신하는 통신부(205)와, 제어부(225)를 포함한다.1 and 2, the service using apparatus 200 includes a microphone 210 for voice input, a speaker 220 for outputting sound, a display unit 215 for outputting a screen, and a network ( It includes a communication unit 205 and the control unit 225 to communicate with the service server 300 through 500.

스피커(220)는 이용자의 음성이나, 통화 상대 이용자의 음성 등 다양한 소리를 출력할 수 있다. 스피커(220)로는 유선 스피커나, 블루투스 무선 스피커 등 소리를 출력할 수 있는 다양한 것이 이용될 수 있다.The speaker 220 may output various sounds, such as a user's voice or a user's voice. As the speaker 220, a variety of things that can output sound, such as a wired speaker or a Bluetooth wireless speaker, can be used.

디스플레이부(215)는 사용자가 시각적으로 인지할 수 있는 다양한 정보를 디스플레이할 수 있다. 디스플레이부(215)는 이용자가 통화 시 서비스 서버(300)가 음성을 변환한 텍스트나, 텍스트를 번역한 번역 텍스트나, 사용자 입력을 위한 GUI 등 다양한 시각적 정보를 표시할 수 있는 다양한 것이 이용될 수 있다.The display unit 215 may display various information visually recognizable by the user. The display unit 215 may be used for a variety of visual information, such as text converted by the service server, text translated by the user, or GUI for user input when the user makes a call. have.

통신부(205)는 서비스 서버(300) 등의 외부 장치로 신호나 데이터를 전송하고, 외부 장치로부터 신호나 데이터를 수신하는 기능을 수행한다. 통신부(205)는 유선 통신 또는 무선 통신 기능을 갖는 다양한 것이 이용될 수 있다.The communication unit 205 transmits a signal or data to an external device such as the service server 300, and performs a function of receiving a signal or data from the external device. A variety of wired communication or wireless communication functions may be used as the communication unit 205.

제어부(225)는 마이크(210)와, 스피커(220)와, 디스플레이부(215)와, 통신부(205)와 전기적으로 연결되고, 서비스 이용장치(200)의 전반적인 동작을 제어한다.The control unit 225 is electrically connected to the microphone 210, the speaker 220, the display unit 215, and the communication unit 205, and controls the overall operation of the service using apparatus 200.

이 밖에, 서비스 이용장치(200)는 입력부(230)와, 저장부(235)와, 인쇄부(240)를 포함할 수 있다. 입력부(230)는 이용자가 정보를 입력하거나, 정보를 선택할 수 있는 다양한 형태로 구비될 수 있다. 이용자는 서비스 이용 시 입력부(230)를 통해 텍스트를 입력함으로써 다른 언어로 번역된 번역 텍스트를 제공받을 수 있다. 저장부(235)는 마이크(210)로 입력되는 음성 정보나, 입력부(230)를 통해 입력되는 정보, 또는 서비스 서버(300)로부터 송신되는 텍스트나, 음성 등 다양한 정보를 저장할 수 있다. 인쇄부(240)는 서비스 이용장치(200)에서 생성되는 데이터나, 서비스 서버(300)로부터 송신되는 데이터나, 저장부(235)에 저장된 데이터 등을 출력하여 이용자에게 제공할 수 있다.In addition, the service using apparatus 200 may include an input unit 230, a storage unit 235, and a printing unit 240. The input unit 230 may be provided in various forms in which a user can input information or select information. When using the service, the user may be provided with translated text translated into another language by inputting text through the input unit 230. The storage unit 235 may store various information such as voice information input through the microphone 210, information input through the input unit 230, text transmitted from the service server 300, and voice. The printing unit 240 may output data generated by the service using apparatus 200, data transmitted from the service server 300, data stored in the storage unit 235, and the like, and provide it to the user.

이러한 서비스 이용장치(200)는 서비스 서버(300)와 통신 가능하게 연결되어 이용자가 다른 이용자와 통화를 가능하게 하고, 디스플레이부(215)를 통해 서비스 서버(300)에서 전송되는 텍스트를 디스플레이하며, 이용자에 의한 정보 입력이나 선택을 가능하게 하는 인터페이스를 제공한다. 이용자는 서비스 이용장치(200)가 제공하는 인터페이스를 통해 디스플레이부(215)에 디스플레이되는 텍스트를 선택함으로써 텍스트에 대한 번역 서비스를 제공받을 수 있다.The service using apparatus 200 is communicatively connected to the service server 300 to enable a user to communicate with other users, and to display text transmitted from the service server 300 through the display unit 215, It provides an interface that enables information input or selection by users. The user may be provided with a translation service for text by selecting the text displayed on the display unit 215 through an interface provided by the service using apparatus 200.

서비스 이용장치(200)로는 개인 휴대 단말기, PC, 노트북 컴퓨터, 태블릿 PC, 넷북 컴퓨터, 디지털 웨어러블 장치, 디지털 TV 등 통신 기능과, 음성 출력 기능과, 디스플레이 기능과, 사용자 입력 인터페이스를 제공할 수 있는 다양한 종류의 것이 이용될 수 있다. 서비스 이용장치(200)에는 서비스 이용을 위한 앱이 설치될 수 있다.The service using device 200 can provide a communication function, a voice output function, a display function, and a user input interface, such as a personal portable terminal, a PC, a notebook computer, a tablet PC, a netbook computer, a digital wearable device, and a digital TV. Various types can be used. An app for using a service may be installed in the service using device 200.

도 1 및 도 3에 나타낸 것과 같이, 서비스 서버(300)는 네트워크(500)를 통해 서비스 이용장치(200) 및 컨설팅 이용장치(400)와 각각 통신 가능하게 연결되어 통신 서비스를 제공한다. 서비스 서버(300)는 통신부(305)와, STT 변환부(310)와, 저장부(315)와, TA 분석부(320)와, 분석 결과 DB(325)와, 질문 답변 생성부(330)와, 번역부(335)와, 평가 기록부(340)와, 이용자 관리부(345)와, 컨설턴트 관리부(350)와, 결제부(355)와, 제어부(360)를 포함한다. 이러한 서비스 서버(300)는 복수의 서비스 이용장치(200)와 네트워크(500)를 통해 연결되어 복수의 이용자에게 통화 서비스를 제공할 수 있다. 즉, 서비스 서버(300)는 복수의 서비스 이용장치(200) 간의 통신을 중계하고, 각각의 서비스 이용장치(200)로부터 음성 또는 영상 신호를 수신하여 음성을 텍스트로 변환하여 이용자들에게 다양한 서비스를 제공할 수 있다.1 and 3, the service server 300 is communicatively connected to the service usage apparatus 200 and the consulting usage apparatus 400 through a network 500 to provide communication services. The service server 300 includes a communication unit 305, an STT conversion unit 310, a storage unit 315, a TA analysis unit 320, an analysis result DB 325, and a question answer generation unit 330 And a translation unit 335, an evaluation recording unit 340, a user management unit 345, a consultant management unit 350, a payment unit 355, and a control unit 360. The service server 300 may be connected to a plurality of service use devices 200 and a network 500 to provide a call service to a plurality of users. That is, the service server 300 relays communication between the plurality of service use devices 200 and receives voice or video signals from each service use device 200 to convert voice to text to provide various services to users. Can provide.

통신부(305)는 네트워크(500)에 접속하여 통신 기능을 수행하며, 서비스 이용장치(200)나 컨설팅 이용장치(400)에 신호나 데이터를 전송하고, 서비스 이용장치(200)나 컨설팅 이용장치(400)로부터 신호나 데이터를 수신할 수 있다. 통신부(305)는 유선 통신 또는 무선 통신 기능을 갖는 다양한 것이 이용될 수 있다.The communication unit 305 connects to the network 500 to perform a communication function, transmits a signal or data to the service using device 200 or the consulting using device 400, and uses the service using device 200 or the consulting using device ( 400) can receive a signal or data. As the communication unit 305, a variety of wired communication or wireless communication functions can be used.

STT 변환부(310)는 서비스 이용장치(200)로부터 수신하는 음성을 텍스트로 변환하는 역할을 한다. 이용자들이 서비스 이용장치(200)를 통해 통화를 하는 동안 STT 변환부(310)가 통화 중의 음성을 실시간으로 텍스트로 변환한다. STT 변환부(310)에 변환되는 텍스트는 통화 대상 이용자들 각각의 서비스 이용장치(200)로 전송되어 각 서비스 이용장치(200)의 디스플레이부(215)에 디스플레이되고, 저장부(315)에 저장될 수 있다.The STT conversion unit 310 serves to convert the voice received from the service using device 200 into text. The STT converter 310 converts the voice in a call to text in real time while users are making a call through the service using device 200. The text converted to the STT conversion unit 310 is transmitted to the service use devices 200 of the respective call target users, displayed on the display unit 215 of each service use device 200, and stored in the storage unit 315 Can be.

저장부(315)는 STT 변환부(310)에 의해 변환되는 텍스트와, 서비스 이용장치(200)로부터 수신되는 음성과, 기타 정보나 데이터를 저장할 수 있다. 저장부(315)가 이용자들 간의 대화를 저장함으로써, 서비스 서버(300)는 녹취록 기능을 제공할 수 있다. 즉, 이용자가 자신의 통화에 대한 내용을 서비스 이용장치(200)에 요청하는 경우, 서비스 이용장치(200)는 저장부(315)에 저장된 통화 내용을 검색하여 해당 이용자가 요청한 통화 내용을 이용자의 서비스 이용장치(200)에 제공할 수 있다.The storage unit 315 may store text converted by the STT conversion unit 310, voice received from the service using apparatus 200, and other information or data. As the storage unit 315 stores conversations between users, the service server 300 may provide a recording function. That is, when the user requests the content of his or her call to the service using device 200, the service using device 200 searches for the content of the call stored in the storage unit 315 and retrieves the content of the call requested by the user. It can be provided to the service using device 200.

TA 분석부(320)는 저장부(315)에 저장된 텍스트를 분석하는 역할을 한다. TA 분석부(320)는 STT 변환부(310)에 의해 변환된 텍스트를 이용하여 이용자들 간의 통화 내용을 분석할 수 있다. 즉, TA 분석부(320)는 STT 변환부(310)에 의해 변환되는 텍스트를 단어 분석, 또는 문구 분석 등의 방법으로 카테고리 별로 정리할 수 있다. 예를 들어, TA 분석부(320)는 텍스트 중에 특정 질문과 이에 대한 답변이 존재하면 관련 내용들을 특정 카테고리로 묶어 정리할 수 있다.The TA analysis unit 320 serves to analyze text stored in the storage unit 315. The TA analysis unit 320 may analyze the content of calls between users by using the text converted by the STT conversion unit 310. That is, the TA analysis unit 320 may organize text converted by the STT conversion unit 310 into categories by a method such as word analysis or phrase analysis. For example, the TA analysis unit 320 may organize and organize related contents into a specific category when a specific question and an answer thereto exist in the text.

분석 결과 DB(325)는 TA 분석부(320)가 분석한 결과를 기록한다. 즉, 분석 결과 DB(325)는 TA 분석부(320)가 분석한 결과를 카테고리 별로 정리하여 기록 및 데이터베이스화 할 수 있다.The analysis result DB 325 records the results analyzed by the TA analysis unit 320. That is, the analysis result DB 325 may organize and record and analyze the results analyzed by the TA analysis unit 320 by category.

다양한 이용자들이 서비스 서버(300)를 통해 통신 서비스를 제공받음에 따라 TA 분석부(320)가 이용자들 간의 다양한 대화 내용을 분석하고, 분석 결과 DB(325)가 TA 분석부(320)가 분석한 내용을 지속적으로 업그레이드할 수 있다. 따라서, 특정 단어나 문구에 대해 이와 관련한 다양한 단어나 문구를 매칭하여 광범위한 데이터베이스 구축이 가능하다.As various users receive communication services through the service server 300, the TA analysis unit 320 analyzes various conversations between users, and the analysis result DB 325 analyzes the TA analysis unit 320 Content can be continuously upgraded. Therefore, it is possible to construct a wide range of databases by matching various words or phrases related to a specific word or phrase.

질문 답변 생성부(330)는 분석 결과 DB(325)에 저장된 데이터로부터 특정 질문과 이에 매칭되는 예상 답변을 생성하여 데이터베이스화 할 수 있다. 즉, 질문 답변 생성부(330)는 분석 결과 DB(325)에 정리되어 있는 특정 질문과 이에 관련한 답변으로부터 특정 질문에 대한 예상 답변을 생성할 수 있다.The question answer generation unit 330 may generate and database a specific question and an expected answer matching the result from data stored in the analysis result DB 325. That is, the question answer generation unit 330 may generate an expected answer for a specific question from a specific question and answers related to the analysis result DB 325.

질문 답변 생성부(330)가 데이터베이스화 한 정보는 이용자들 간의 통화 중에 제공될 수 있다. 즉, 이용자들 간의 통화가 진행되는 동안 STT 변환부(310)에 의해 변환되는 텍스트 중에 질문 답변 생성부(330)에 저장된 질문에 대응하는 텍스트가 생성되는 경우, 서비스 서버(300)는 질문 답변 생성부(330)에 저장된 예상 답변 중 해당 질문에 매칭되는 예상 답변을 서비스 이용장치(200)에 제공할 수 있다. 이러한 예상 답변 기능은 원격 인터뷰나, 외국어로 진행되는 미팅 중에 답변을 해야 하는 이용자에게 유용하게 이용될 수 있다.The information generated by the question and answer generator 330 may be provided during a call between users. That is, when text corresponding to a question stored in the question answer generation unit 330 is generated among texts converted by the STT conversion unit 310 during a call between users, the service server 300 generates a question answer Among the expected answers stored in the unit 330, an expected answer matching the corresponding question may be provided to the service using apparatus 200. This expected answer function can be useful for users who need to answer during a remote interview or a meeting conducted in a foreign language.

번역부(335)는 STT 변환부(310)에 의해 변환되는 텍스트 또는 서비스 이용장치(200)의 입력부(230)를 통해 입력되는 텍스트에 대한 번역 기능을 수행한다. 이용자가 통신 서비스 이용 중에 텍스트의 번역을 요청하는 경우, 번역부(335)가 이용자가 번역 요청한 텍스트를 영어, 중국어 등 다양한 외국어로 번역하거나, 영어나 중국어 등 외국어 텍스트를 번역한 이용자의 자국어로 번역한 텍스트를 해당 서비스 이용장치(200)의 디스플레이부(215)에 표시할 수 있다. 여기에서, 번역 텍스트는 단어, 문장 등 이용자가 필요로 하는 다양한 형태로 제공될 수 있다.The translation unit 335 translates text converted by the STT conversion unit 310 or text input through the input unit 230 of the service using apparatus 200. When the user requests translation of the text while using the communication service, the translation unit 335 translates the text requested by the user into various foreign languages such as English and Chinese, or translates the user's native language into English or Chinese. One text may be displayed on the display unit 215 of the corresponding service using device 200. Here, the translated text may be provided in various forms required by the user, such as words and sentences.

이러한 번역 서비스를 위해 서비스 이용장치(200)는 디스플레이부(215)에 표시되는 텍스트를 선택할 수 있는 인터페이스를 제공할 수 있다. 이용자가 통화 중에 서비스 이용장치(200)의 인터페이스를 통해 특정 텍스트를 선택하면 번역부(335)가 이용자에 의해 선택된 텍스트에 대한 번역 텍스트를 서비스 이용장치(200)의 디스플레이부(215)에 표시할 수 있다. 디스플레이부(215)에 표시되는 텍스트를 선택하기 위한 인터페이스는 텍스트 터치를 통한 선택, 마우스 클릭에 의한 선택 등 다양한 방식으로 구현될 수 있다.For such a translation service, the service using apparatus 200 may provide an interface for selecting text displayed on the display unit 215. When the user selects a specific text through the interface of the service using apparatus 200 during a call, the translation unit 335 displays the translated text for the text selected by the user on the display unit 215 of the service using apparatus 200. Can. The interface for selecting the text displayed on the display unit 215 may be implemented in various ways such as selection through text touch and selection by mouse click.

평가 기록부(340)는 컨설턴트가 저장부(315)에 저장된 통화 내용에 대해 평가를 수행한 결과를 수신하고, 컨설턴트의 평가 결과를 기록하는 기능을 갖는 것으로, 서비스 서버(300)는 평가 기록부(340)를 통해 이용자에게 통화 내용에 대한 평가 서비스를 제공할 수 있다. 즉, 서비스 서버(300)는 컨설턴트로부터 이용자의 통화 내용에 대한 평가를 입력 받고, 이용자가 자신의 통화 내용에 대한 컨설턴트의 평가를 요청하는 경우, 평가 기록부(340)에 저장된 평가 결과를 이용자에게 제공할 수 있다.The evaluation recording unit 340 has a function of receiving a result of a consultant performing evaluation on the contents of a call stored in the storage unit 315 and recording the evaluation result of the consultant, and the service server 300 has an evaluation recording unit 340 ), it is possible to provide an evaluation service for the contents of the call to the user. That is, the service server 300 receives the evaluation of the call content of the user from the consultant, and when the user requests the evaluation of the consultant about the content of the call, the service server 300 provides the evaluation result stored in the evaluation recording unit 340 to the user can do.

컨설턴트는 저장부(315)에 저장된 통화 내용에 대해 평가를 할 수 있도록 사전에 인증되고 등록된 전문가일 수 있다. 다양한 카테고리에 대한 전문적인 컨설팅을 위해 본 발명에 따른 음성인식을 이용한 통화 서비스 시스템(100)은 다양한 분야에 대한 전문 지식을 가진 다양한 전문가를 컨설턴트로 등록받아 서비스를 제공할 수 있다.The consultant may be an expert who has been previously authenticated and registered so that the contents of the call stored in the storage unit 315 can be evaluated. For professional consulting for various categories, the call service system 100 using voice recognition according to the present invention can provide services by registering various experts with expertise in various fields as consultants.

예를 들어, 이용자들 간의 원격 인터뷰가 이루어지는 경우, 컨설턴트가 인터뷰 내용에 대한 평가를 수행하여 평가 기록부(340)에 저장하고, 이용자는 컨설턴트가 수행한 인터뷰 평가를 요청하여 제공받을 수 있다. 또한, 외국어 미팅이 이루어지는 경우, 컨설턴트가 이용자의 외국어 대화에 대해 평가를 수행하여 평가 기록부(340)에 저장하고, 이용자는 컨설턴트가 수행한 외국어 미팅에 대한 평가를 요청하여 제공받을 수 있다. 따라서, 이용자는 자신의 실력이나 태도, 보완해야 할 부분에 대해 편리하게 전문가의 컨설팅을 받을 수 있다.For example, when a remote interview is made between users, the consultant performs evaluation on the interview contents and stores it in the evaluation recorder 340, and the user can request and receive an interview evaluation performed by the consultant. In addition, when a foreign language meeting is conducted, the consultant performs evaluation on the user's foreign language conversation and stores it in the evaluation recorder 340, and the user can request and receive an evaluation for the foreign language meeting conducted by the consultant. Therefore, the user can conveniently receive expert consulting on their skills, attitudes, and areas to be supplemented.

이용자 관리부(345)는 서비스 이용자들의 정보를 저장 및 관리한다. 이용자가 서비스 이용을 위해 등록한 개인 정보들이 이용자 관리부(345)에서 관리될 수 있다.The user management unit 345 stores and manages information of service users. The personal information registered by the user for use of the service may be managed by the user management unit 345.

컨설턴트 관리부(350)는 인증된 컨설턴트들의 정보를 관리한다. 또한, 컨설턴트 관리부(350)는 사전 설정 절차에 따라 인증된 컨설턴트들의 정보를 서비스 이용장치(200)에 제공할 수 있다. 컨설턴트 관리부(350)가 컨설턴트들의 정보를 서비스 이용장치(200)에 제공하면, 이용자가 컨설턴트 관리부(350)에서 제공되는 컨설턴트들의 정보를 확인하고 자신이 컨설팅받기 원하는 특정 컨설턴트를 지정하여 컨설팅을 받을 수 있다.The consultant management unit 350 manages information of certified consultants. In addition, the consultant management unit 350 may provide information of certified consultants to the service using apparatus 200 according to a preset procedure. When the consultant management unit 350 provides the consultant's information to the service use device 200, the user can check the information of the consultants provided by the consultant management unit 350 and designate a specific consultant to be consulted to receive consulting. have.

결제부(355)는 이용자의 서비스 비용 결제를 위한 것으로, 이용자가 결제부(355)를 통해 서비스 이용을 위한 기본 비용 또는 부가 서비스 비용을 결제할 수 있다.The payment unit 355 is for payment of the service cost of the user, and the user can pay the basic cost or additional service cost for using the service through the payment unit 355.

통화 내용에 대한 컨설턴트의 컨설팅 서비스는 부가 서비스 비용을 지불한 경우에 한해 이용자에게 제공될 수 있다. 이 경우, 서비스 서버(300)는 이용자가 자신의 통화 내용에 대한 컨설턴트의 평가를 요청하는 경우, 결제부(355)를 통한 결제 여부를 확인하고, 결제가 이루어진 경우에 한해 평가 기록부(340)에 저장된 평가 결과를 이용자에게 제공할 수 있다.Consulting services of consultants on the contents of calls can be provided to users only if they have paid for additional services. In this case, the service server 300 checks whether or not payment is made through the payment unit 355 when the user requests the consultant's evaluation of the contents of his/her call, and only when the payment is made, the evaluation server 340 Stored evaluation results can be provided to the user.

제어부(360)는 통신부(305)와, STT 변환부(310)와, 저장부(315)와, TA 분석부(320)와, 분석 결과 DB(325)와, 질문 답변 생성부(330)와, 번역부(335)와, 평가 기록부(340)와, 이용자 관리부(345)와, 컨설턴트 관리부(350)와, 결제부(355)와 전기적으로 연결되고, 서비스 서버(300)의 전반적인 동작을 제어한다.The control unit 360 includes a communication unit 305, an STT conversion unit 310, a storage unit 315, a TA analysis unit 320, an analysis result DB 325, and a question and answer generation unit 330. , The translation unit 335, the evaluation recording unit 340, the user management unit 345, the consultant management unit 350, and the payment unit 355 is electrically connected, and controls the overall operation of the service server 300 do.

도 1 및 도 4에 나타낸 것과 같이, 컨설팅 이용장치(400)는 컨설턴트가 서비스 서버(300)의 저장부(315)에 저장된 통화 내용에 대해 평가를 수행할 수 있는 인터페이스를 제공하는 것으로, 통신부(405)와, 디스플레이부(410)와, 입력부(415)와, 저장부(420)와, 제어부(425)를 포함한다.As shown in FIGS. 1 and 4, the consulting usage device 400 provides an interface through which a consultant can perform evaluation on the contents of a call stored in the storage unit 315 of the service server 300. 405), a display unit 410, an input unit 415, a storage unit 420, and a control unit 425.

통신부(405)는 서비스 서버(300) 등의 외부 장치로 신호나 데이터를 전송하고, 외부 장치로부터 신호나 데이터를 수신하는 기능을 수행한다. 통신부(405)는 유선 통신 또는 무선 통신 기능을 갖는 다양한 것이 이용될 수 있다.The communication unit 405 transmits a signal or data to an external device such as the service server 300, and performs a function of receiving a signal or data from the external device. A variety of wired communication or wireless communication functions may be used as the communication unit 405.

디스플레이부(410)는 컨설턴트가 시각적으로 인지할 수 있는 다양한 정보를 디스플레이할 수 있다. 디스플레이부(410)는 서비스 서버(300)의 저장부(315)에 저장되는 텍스트 자료 등 서비스 서버(300)가 제공하는 각종 데이터나, 입력을 위한 GUI 등 다양한 시각적 정보를 표시할 수 있는 다양한 것이 이용될 수 있다.The display unit 410 may display various information visually recognizable by the consultant. The display unit 410 includes various data provided by the service server 300, such as text data stored in the storage unit 315 of the service server 300, and various visual information such as a GUI for input. Can be used.

입력부(415)는 컨설턴트가 정보를 입력하거나, 정보를 선택할 수 있는 다양한 형태로 구비될 수 있다The input unit 415 may be provided in various forms in which a consultant can input information or select information.

저장부(420)는 서비스 서버(300)로부터 제공되는 데이터나, 컨설턴트가 수행한 평가 결과 등 컨설팅 이용장치(400)로 전송되는 각종 데이터나, 입력부(415)에 의해 입력되는 데이터 등 다양한 정보를 저장할 수 있다.The storage unit 420 may receive various information such as data provided from the service server 300, various data transmitted to the consulting using device 400, such as evaluation results performed by a consultant, or data input by the input unit 415. Can be saved.

제어부(425)는 통신부(405)와, 디스플레이부(410)와, 입력부(415)와, 저장부(420)와 전기적으로 연결되고, 컨설팅 이용장치(400)의 전반적인 동작을 제어한다.The control unit 425 is electrically connected to the communication unit 405, the display unit 410, the input unit 415, and the storage unit 420, and controls the overall operation of the consulting usage device 400.

이하에서는 상술한 것과 같은 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 이용한 음성인식을 이용한 통화 서비스 방법에 대하여 설명한다.Hereinafter, a call service method using voice recognition using a call service system using voice recognition according to an embodiment of the present invention as described above will be described.

도 5는 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 시스템을 이용한 음성인식을 이용한 통화 서비스 방법을 설명하기 위한 순서도이다.5 is a flowchart illustrating a call service method using voice recognition using a call service system using voice recognition according to an embodiment of the present invention.

도면에 나타낸 것과 같이, 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 방법은, 서비스 등록 단계(S11)와, 사용자 인증 단계(S12)와, 전화 걸기 단계(S13)와, 실시간 STT 변환 및 저장 단계(S14)를 포함한다.As shown in the figure, the call service method using voice recognition according to an embodiment of the present invention includes a service registration step (S11), a user authentication step (S12), a dialing step (S13), and real-time STT conversion. And a storage step (S14).

서비스 등록 단계(S11)는 서비스 이용을 원하는 이용자가 서비스 서버(300)에 자신의 이용자 정보를 제공하고 서비스 이용을 신청하는 단계이다. 이 단계에서, 이용자가 서비스 신청 웹페이지에 접속하여 회원 가입을 하거나, 앱을 설치하고 서비스 등록 절차를 수행할 수 있다. 서비스 등록 단계(S11)에서 이용자가 제공한 정보는 서비스 서버(300)의 이용자 관리부(345)에 기록되어 관리될 수 있다.The service registration step S11 is a step in which a user who wants to use the service provides his/her user information to the service server 300 and applies for service use. At this stage, the user can access the service application webpage to sign up for membership, install the app, and perform the service registration process. The information provided by the user in the service registration step S11 may be recorded and managed in the user management unit 345 of the service server 300.

사용자 인증 단계(S12)는 서비스 가입 이용자가 자신의 서비스 이용장치(200)를 이용하여 네트워크(500)를 통해 서비스 서버(300)에 접속할 때, 서비스 서버(300)가 이용자 관리부(345)에 기록된 이용자 정보를 조회하고 이용자를 확인하는 단계이다.In the user authentication step (S12), when the service subscription user accesses the service server 300 through the network 500 using his/her service using apparatus 200, the service server 300 records in the user management unit 345 It is a step of inquiring the user information and confirming the user.

사용자 인증 단계(S12)에서 인증된 이용자는 전화 걸기 단계(S13)에서 전화 걸기를 수행하여 통화를 원하는 다른 이용자와 통화를 할 수 있다. 이때, 통화는 원격 인터뷰, 원격 회의 등 다양한 목적으로, 음성 통화 또는 영상 통화, 일 대 일 통화, 일 대 다수 통화, 다수 대 다수 통화 등 다양한 형태로 이루어질 수 있다.The user authenticated in the user authentication step (S12) can make a call with another user who wants to make a call by performing the phone call in the dialing step (S13). In this case, the call may be made in various forms such as a voice call or a video call, a one-to-one call, a one-to-many call, a multi-to-many call for various purposes such as a remote interview and a remote meeting.

이용자들 간의 통화가 이루어지는 동안 실시간 STT 변환 및 저장 단계(S14)가 실행된다. 서비스 서버(300)는 서비스 서버(300)에 접속하여 통화 서비스를 제공받는 복수의 서비스 이용장치(200)로부터 음성을 수신하고, STT 변환부(310)를 통해 수신되는 음성을 텍스트로 변환한다. 이때, STT 변환부(310)에 의해 변환되는 텍스트는 서비스 서버(300)의 저장부(315)에 저장되고, 복수의 서비스 이용장치(200)에 각각 제공되어 각 서비스 이용장치(200)에서 디스플레이될 수 있다.During a call between users, a real-time STT conversion and storage step (S14) is executed. The service server 300 connects to the service server 300 to receive voices from a plurality of service use devices 200 receiving a call service, and converts voices received through the STT converter 310 into text. At this time, the text converted by the STT conversion unit 310 is stored in the storage unit 315 of the service server 300 and provided to each of a plurality of service use devices 200 to be displayed on each service use device 200 Can be.

각 서비스 이용장치(200)에 제공되는 텍스트는 도 6와 같은 형태로 각 서비스 이용장치(200)에 구비되는 디스플레이부(215)의 텍스트 표시 영역(216)에 실시간으로 디스플레이될 수 있으며, 서비스 이용자는 통화 내용을 텍스트로 확인할 수 있다. 이용자의 디스플레이부(215)에는 변환된 텍스트 이외에 상대 이용자나, 자신의 모습, 또는 다양한 다른 정보가 디스플레이될 수 있다.The text provided to each service using device 200 may be displayed in real time on the text display area 216 of the display unit 215 provided in each service using device 200 in the form shown in FIG. You can check the contents of the call in text. In addition to the converted text, the user's display unit 215 may display a counterpart user, his or her appearance, or various other information.

이용자들 간에 통화가 이루어는 동안 서비스 서버(300)의 저장부(315)에는 STT 변환부(310)에 의해 변환되는 텍스트 이외에 이용자들의 음성도 함께 저장될 수 있다. 그리고 서비스 서버(300)의 저장부(315)에 저장되는 텍스트와 음성은 이용자용 서비스 이용장치(200)의 저장부(235)에도 동일하게 저장될 수 있다.In addition to the text converted by the STT converter 310 in the storage unit 315 of the service server 300 while a call is being made between users, voices of users may also be stored. In addition, text and voice stored in the storage unit 315 of the service server 300 may be stored in the storage unit 235 of the user service use apparatus 200 in the same way.

STT 변환 및 저장 단계(S14) 이후, TA 분석 및 저장 단계(S16)와, 예상 답변 제공 단계(S17)가 수행될 수 있다.After the STT conversion and storage step (S14), a TA analysis and storage step (S16) and an expected answer providing step (S17) may be performed.

TA 분석 및 저장 단계(S16)에서 STT 변환부(310)에 의해 변환되어 저장부(315)에 저장되는 텍스트가 TA 분석부(320)에 의해 분석되고, TA 분석부(320)의 분석 결과가 분석 결과 DB(325)에 기록된다. 이 단계에서, TA 분석부(320)가 STT 변환부(310)에 의해 변환되는 텍스트를 단어 분석, 또는 문구 분석 등의 방법으로 카테고리 별로 정리하고, 분석 결과 DB(325)가 TA 분석부(320)의 분석 결과를 카테고리 별로 정리하여 기록 및 데이터베이스화 한다. 이때, 특정 단어나 문구에 대해 이와 관련한 다른 단어나 문구가 광범위하게 매칭되어 데이터베이스화 될 수 있고, 질문 답변 생성부(330)가 분석 결과 DB(325)에 저장된 데이터로부터 특정 질문과 이에 매칭되는 예상 답변을 생성하여 데이터베이스화 할 수 있다.In the TA analysis and storage step (S16), the text converted by the STT conversion unit 310 and stored in the storage unit 315 is analyzed by the TA analysis unit 320, and the analysis result of the TA analysis unit 320 is displayed. The analysis results are recorded in DB 325. In this step, the TA analysis unit 320 organizes the text converted by the STT conversion unit 310 into categories by word analysis or phrase analysis, and the analysis result DB 325 analyzes the TA analysis unit 320 ) The results of analysis are organized by category to record and database. At this time, for a specific word or phrase, other words or phrases related to this may be extensively matched and databased, and the question answer generator 330 predicts matching with a specific question from data stored in the analysis result DB 325 You can create an answer and database it.

예상 답변 제공 단계(S17)는 이용자들 간의 통화가 이루어지는 동안 서비스 서버(300)가 특정 질문에 대한 예상 답변을 이용자에게 제공하는 단계이다. 서비스 이용장치(200)는 통화가 이루어지는 동안 복수의 서비스 이용장치(200)로부터 음성을 수신하고, 수신되는 음성을 상기 STT 변환부(310)를 통해 텍스트로 변환하고, STT 변환부(310)에 의해 변환되는 텍스트를 TA 분석부(320)로 분석하는 과정을 수행하면서 복수의 이용자 중 질문자와 답변자를 구분할 수 있다. 그리고 STT 변환부(310)에 의해 변환되는 텍스트 중에 질문 답변 생성부(330)에 저장되어 있는 질문에 대응하는 텍스트가 생성되는 경우, 질문 답변 생성부(330)에 저장된 예상 답변 중 해당 질문에 매칭되는 예상 답변을 복수의 서비스 이용장치(200) 중에서 질문에 대해 답변해야 하는 답변자의 서비스 이용장치(200)에 제공할 수 있다.The expected answer providing step (S17) is a step in which the service server 300 provides an expected answer to a specific question to the user while a call is made between users. The service using device 200 receives a voice from a plurality of service using devices 200 during a call, converts the received voice into text through the STT converter 310, and sends it to the STT converter 310. While performing the process of analyzing the text converted by the TA analysis unit 320, it is possible to distinguish between the questioner and the answerer among a plurality of users. In addition, if text corresponding to the question stored in the question answer generator 330 is generated among the text converted by the STT converter 310, the corresponding question is matched among the expected answers stored in the question answer generator 330 The expected answer can be provided to the service use device 200 of the answerer who needs to answer the question among the plurality of service use devices 200.

답변자의 서비스 이용장치(200)에 제공되는 예상 답변은 도 7에 나타낸 것과 같이, 서비스 이용장치(200)에 구비되는 디스플레이부(215)의 예상 답변 표시 영역(217)에 디스플레이될 수 있다. 따라서, 이용자는 통화 서비스 이용 중에 상대방의 질문에 대해 서비스 이용장치(200)가 추천하는 예상 답변을 참고하여 상대방의 질문에 대응할 수 있다.The predicted answer provided to the respondent's service using apparatus 200 may be displayed on the expected answer display area 217 of the display unit 215 provided in the service using apparatus 200, as shown in FIG. 7. Accordingly, the user may respond to the other party's question by referring to the expected answer recommended by the service using apparatus 200 to the other party's question while using the call service.

이용자들 간의 통화가 원격 인터뷰인 경우, 예상 답변으로는 면접 관련 질문에 대한 답변일 수 있다. 그리고 이용자들 간의 통화가 외국어 미팅인 경우, 예상 답변으로는 외국어 질문에 대한 외국어 답변일 수 있다.If the call between users is a remote interview, the expected answer may be an answer to an interview question. In addition, when the call between users is a foreign language meeting, the expected answer may be a foreign language answer to a foreign language question.

한편, 이용자는 통화 서비스 중에 자신의 디스플레이부(215)에 표시되는 텍스트에 대한 번역 서비스를 받을 수 있다. 이용자의 서비스 이용장치(200)에 제공되는 텍스트가 외국어인 경우, 이용자는 서비스 이용장치(200)가 제공하는 입력 인터페이스를 통해 번역이 필요한 텍스트를 선택할 수 있다. 이때, 번역부(335)가 STT 변환부(310)에 의해 변환되는 텍스트에 대한 번역 기능을 수행함으로써, 서비스 이용장치(200)의 디스플레이부(215)에 번역 텍스트가 표시될 수 있다. 또한, 이용자의 서비스 이용장치(200)에 제공되는 텍스트가 이용자의 자국어인 경우, 이용자는 다양한 외국어 번역 텍스트를 제공받을 수 있다. 이를 위해 이용자는 서비스 이용장치(200)가 제공하는 입력 인터페이스를 통해 번역이 필요한 텍스트와 번역을 원하는 외국어를 선택함으로써, 서비스 서버(300)로부터 외국어 번역 텍스트를 제공받을 수 있다.Meanwhile, a user may receive a translation service for text displayed on his/her display unit 215 during a call service. When the text provided to the user's service using apparatus 200 is a foreign language, the user may select text that needs to be translated through an input interface provided by the service using apparatus 200. At this time, the translation unit 335 performs the translation function for the text converted by the STT conversion unit 310, so that the translation text can be displayed on the display unit 215 of the service using apparatus 200. In addition, when the text provided to the user's service using apparatus 200 is the user's native language, the user can be provided with various foreign language translated texts. To this end, the user can receive text translated from the service server 300 by selecting text that needs to be translated and a foreign language desired to be translated through the input interface provided by the service using apparatus 200.

번역 텍스트는 도 8에 나타낸 것과 같이 서비스 이용장치(200)에 구비되는 디스플레이부(215)의 번역문 표시 영역(218)에 디스플레이될 수 있다. 따라서, 이용자는 통화 서비스 이용 중에 외국어 텍스트를 어렵지 않게 해석할 수 있다. 번역문 표시 영역(218)은 도시된 것과 같은 말풍선 도형 형태 등 텍스트 표시 영역(216)과 구분될 수 있는 다양한 형태로 이루어질 수 있다.The translated text may be displayed on the translated text display area 218 of the display unit 215 provided in the service using apparatus 200 as shown in FIG. 8. Therefore, the user can interpret the foreign language text without difficulty during the use of the call service. The translation display area 218 may be formed in various forms that can be distinguished from the text display area 216, such as a speech bubble shape as illustrated.

이러한 번역 서비스는 서비스 서버(300)가 아닌 서비스 이용장치(200) 자체의 기능을 이용하여 제공될 수 있다. 이 경우, 서비스 이용장치(200)에 설치되는 사전 프로그램이나 앱, 또는 번역 프로그램이나 앱을 작동시켜 텍스트에 대한 번역 서비스를 제공할 수 있다.The translation service may be provided using a function of the service using device 200 itself, not the service server 300. In this case, a translation service for text may be provided by operating a dictionary program or an app installed in the service using device 200 or a translation program or app.

또한, 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 방법은 이용자에게 통화 내용에 대한 컨설팅 서비스를 제공하기 위한 단계를 포함할 수 있다.In addition, the call service method using voice recognition according to an embodiment of the present invention may include a step for providing a consulting service for the content of the call to the user.

즉, 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 방법은 이용자에게 통화 내용에 대한 컨설팅 서비스를 제공하기 위해 도 9에 나타낸 것과 같이, 대화 목록 선택 단계(S21)와, 컨설턴트 선택 단계(S22)와, 결제 단계(S23)와, 평가 단계(S24)와, 평가 결과 제공 단계(S25)를 포함할 수 있다.That is, in the call service method using voice recognition according to an embodiment of the present invention, as shown in FIG. 9, in order to provide a consulting service for the contents of a call to a user, a conversation list selection step (S21) and a consultant selection step ( S22), a payment step (S23), an evaluation step (S24), and an evaluation result providing step (S25).

대화 목록 선택 단계(S21)는 이용자가 자신이 과거에 통화했던 대화 목록을 자신의 서비스 이용장치(200)를 통해 확인하고, 컨설턴트로부터 평가를 받기를 원하는 대화 내용을 선택하는 단계이다. 이를 위해 이용자가 자신의 서비스 이용장치(200)를 통해 서비스 서버(300)에 접속하여 자신의 과거 대화 목록 및 내용을 검색할 수 있도록 하거나, 서비스 서버(300)에 저장되어 있는 이용자의 대화 목록 및 내용이 서비스 이용장치(200)에 제공될 수 있다.The conversation list selection step (S21) is a step in which the user checks the conversation list that he or she has called in the past through his service use apparatus 200, and selects the conversation content that he or she wants to be evaluated by a consultant. To this end, the user can access the service server 300 through his/her service using device 200 and search his/her past conversation list and contents, or the user's conversation list stored in the service server 300 and Content may be provided to the service using device 200.

컨설턴트 선택 단계(S22)는 이용자가 자신이 평가를 받기를 원하는 컨설턴트를 선택하는 단계이다. 이를 위해 이용자가 자신의 서비스 이용장치(200)를 통해 서비스 서버(300)에 접속하여 서비스 서버(300)의 컨설턴트 관리부(350)에 저장되어 있는 컨설턴트들의 리스트 및 정보를 검색할 수 있도록 하거나, 컨설턴트 관리부(350)에 저장되어 있는 컨설턴트들의 리스트 및 정보가 서비스 이용장치(200)에 제공될 수 있다.The consultant selection step (S22) is a step in which the user selects a consultant who wishes to be evaluated. To this end, a user can access the service server 300 through his/her service using device 200 and search for a list and information of consultants stored in the consultant management unit 350 of the service server 300, or a consultant A list and information of consultants stored in the management unit 350 may be provided to the service using device 200.

결제 단계(S23)는 이용자가 컨설팅 서비스 이용을 위한 비용을 결제하는 단계이다. 이용자는 자신의 서비스 이용장치(200)를 통해 서비스 서버(300)에 접속하여 비용을 결제하거나, 다양한 다른 방식으로 결제를 할 수 있다. 이때, 서비스 서버(300)의 결제부(355)가 결제 정보를 기록 및 관리한다.The payment step S23 is a step in which the user pays for the use of the consulting service. The user may access the service server 300 through the service using device 200 to pay the cost, or pay in various other ways. At this time, the payment unit 355 of the service server 300 records and manages payment information.

평가 단계(S24)는 결제 완료된 이용자의 특정 통화 내용에 대해 이용자에 의해 선택된 컨설턴트가 평가를 수행하는 단계이다. 이용자가 평가를 받기를 원하는 통화 목록과 컨설턴트를 선택하고 결제를 완료하면, 서비스 서버(300)가 해당 컨설턴트에 이를 통지하게 된다. 서비스 서버(300)는 컨설턴트 관리부(350)에 사전에 등록되어 있는 컨설턴트의 연락처(전환 번호, 이메일, SNS 등)로 지정된 컨설턴트에 컨설팅 요청 내용을 통지할 수 있다.The evaluation step S24 is a step in which the consultant selected by the user performs evaluation on the specific currency content of the user who has completed the payment. When the user selects a currency list and a consultant who wants to be evaluated and completes the payment, the service server 300 notifies the consultant. The service server 300 may notify the consultant of the consulting request to the consultant designated as the contact number (conversion number, email, SNS, etc.) of the consultant registered in advance in the consultant management unit 350.

컨설팅을 요청 받은 컨설턴트는 자신의 컨설팅 이용장치(400)를 통해 서비스 서버(300)에 접속하여 이용자의 통화 내용에 대해 평가를 수행할 수 있다. 이를 위해 컨설턴트가 자신의 컨설팅 이용장치(400)를 통해 서비스 서버(300)에 접속하여 이용자의 대화 목록 및 내용을 검색할 수 있도록 하거나, 서비스 서버(300)에 저장되어 있는 이용자의 대화 목록 및 내용이 컨설팅 이용장치(400)에 제공될 수 있다. 컨설턴트가 수행한 평가 결과는 평가 기록부(340)에 기록된다.The consultant who has been requested to consult may access the service server 300 through his/her consulting use device 400 and perform evaluation on the user's call content. To this end, a consultant can access the service server 300 through his/her consulting use device 400 to search the user's conversation list and contents, or the user's conversation list and contents stored in the service server 300 It can be provided to the consulting using device 400. The evaluation results performed by the consultant are recorded in the evaluation recording unit 340.

평가 결과 제공 단계(S25)는 이용자에게 컨설턴트의 평가 결과를 제공하는 단계이다. 서비스 서버(300)는 평가 기록부(340)에 저장된 평가 결과를 해당 이용자의 서비스 이용장치(200)에 제공할 수 있다.The evaluation result providing step (S25) is a step for providing the evaluation result of the consultant to the user. The service server 300 may provide the evaluation result stored in the evaluation recording unit 340 to the user's service using device 200.

이러한 본 발명의 일실시예에 따른 음성인식을 이용한 통화 서비스 방법에 있어서, 대화 내용에 대한 텍스트나 음성의 저장이나, 텍스트 분석, 통화 내용의 평가 등은 서비스 이용자 중 이에 대해 동의한 이용자들에 한해 수행될 수 있다.In the call service method using voice recognition according to an embodiment of the present invention, storage of text or voice for conversation content, text analysis, evaluation of call content, etc., are limited to users who agree to this among service users. Can be performed.

상술한 것과 같이, 본 발명에 따르면, 이용자들이 원격 인터뷰, 원격 회의 등 다양한 목적으로, 음성 통화 또는 영상 통화, 일 대 일 통화, 일 대 다수 통화, 다수 대 다수 통화 등 다양한 형태의 통화를 가능하게 하는 통화 서비스를 제공하면서, 이용자들 간의 대화 내용을 텍스트로 변환하여 실시간으로 이용자들에게 제공할 수 있다. 따라서, 이용자들 간의 더욱 원활하고 정확한 통화를 가능하게 한다.As described above, according to the present invention, users can make various types of calls, such as voice calls or video calls, one-to-one calls, one-to-many calls, many-to-many calls, for various purposes such as remote interviews and teleconferencing. While providing a call service, the conversation between users can be converted into text and provided to users in real time. Therefore, it enables a more smooth and accurate call between users.

또한, 본 발명에 따르면, 통화 내용이 변환되는 텍스트가 저장되고, 저장된 텍스트를 서비스 서버가 분석하여 특정 단어나 문구에 대해 이와 관련한 다른 단어나 문구를 광범위하게 매칭하여 카테고리 별로 분류하고 데이터베이스화 할 수 있다. 이러한 데이터는 다양한 분야예서 유용하게 활용될 수 있는 빅데이터로 제공될 수 있다.In addition, according to the present invention, the text in which the content of the call is converted is stored, and the service server analyzes the stored text to broadly match other words or phrases related to a specific word or phrase and classify and categorize them into categories. have. Such data can be provided as big data that can be usefully utilized in various fields.

또한, 본 발명에 따르면, 서비스 서버가 이용자들 간의 통화가 진행되는 동안, 특정 질문에 대한 예상 답변을 필요한 이용자에게 제공할 수 있다. 따라서, 원격 인터뷰나, 외국어로 진행되는 미팅 중에 답변을 해야 하는 이용자에게 유용하게 이용될 수 있다.In addition, according to the present invention, the service server can provide an expected answer to a specific question to a user in need while a call between users is in progress. Therefore, it can be useful for users who need to answer during a remote interview or a meeting conducted in a foreign language.

또한, 본 발명에 따르면, 이용자들 간의 통화가 진행되는 동안, 서비스 이용장치에 디스플레이되는 텍스트에 대한 번역 텍스트를 제공할 수 있다.Further, according to the present invention, it is possible to provide a translation text for text displayed on the service using apparatus while a call between users is in progress.

또한, 본 발명에 따르면, 이용자가 자신의 통화 내용에 대해 전문적인 컨설턴트로부터 컨설팅을 받을 수 있다. 즉, 이용자는 자신이 과거에 수행했던 원격 인터뷰나, 원격 외국어 미팅 등에 대해 컨설턴트의 평가를 받음으로써, 자신의 부족한 부분에 대한 정보를 얻을 수 있다.In addition, according to the present invention, the user can receive consulting from a professional consultant on the content of his or her call. In other words, the user can obtain information about his/her shortage by receiving a consultant's evaluation on a remote interview or a remote foreign language meeting that he/she conducted in the past.

이상 본 발명에 대해 바람직한 예를 들어 설명하였으나 본 발명의 범위가 앞에서 설명되고 도시되는 형태로 한정되는 것은 아니다.The present invention has been described as a preferred example, but the scope of the present invention is not limited to the form described and illustrated above.

예를 들어, 이용자용 서비스 이용장치(200)나, 컨설팅 이용장치(400)는 도시된 구조로 한정되지 않고 다양하게 변경될 수 있다. 예를 들어, 이용자용 서비스 이용장치(600)는 도 10에 나타낸 것과 같이, 디스플레이부(215)와, 스피커(220)와, 휴대 단말기(610)를 포함할 수 있다. 디스플레이부(215)와, 스피커(220)는 유선 또는 무선 연결 가능한 것이 이용될 수 있다. 이러한 서비스 이용장치(600)는 휴대 단말기(610)의 다양한 기능을 활용하여 통화, 입력, 번역 등의 기능을 수행할 수 있다.For example, the user service use device 200 or the consulting use device 400 is not limited to the illustrated structure and may be variously changed. For example, as shown in FIG. 10, the user service use apparatus 600 may include a display unit 215, a speaker 220, and a portable terminal 610. The display unit 215 and the speaker 220 may be wired or wirelessly connectable. The service using apparatus 600 may perform functions such as call, input, and translation by utilizing various functions of the mobile terminal 610.

또한, 앞서서는 컨설턴트가 이용자의 통화 내용을 평가하기 위해 네트워크(500)를 통해 서비스 서버(300)와 접속할 수 있는 컨설턴트용 컨설팅 이용장치(400)가 별도로 구비되는 것으로 나타냈으나, 컨설팅 이용장치(400)는 생략될 수 있다. 이 경우, 서비스 서버(300)가 컨설턴트가 평가를 수행할 수 있는 인터페이스를 제공하여, 컨설턴트가 서비스 서버(300)를 통해 이용자의 통화 내용을 평가할 수 있다.In addition, previously, it has been shown that the consultant using the consulting device 400 for accessing the service server 300 through the network 500 is separately provided for the consultant to evaluate the call content of the user, but the consulting using device ( 400) may be omitted. In this case, the service server 300 provides an interface through which the consultant can perform evaluation, so that the consultant can evaluate the user's call content through the service server 300.

이상, 본 발명을 본 발명의 원리를 예시하기 위한 바람직한 실시예와 관련하여 도시하고 설명하였으나, 본 발명은 그와 같이 도시되고 설명된 그대로의 구성 및 작용으로 한정되는 것이 아니다. 오히려 첨부된 청구범위의 사상 및 범위를 일탈함이 없이 본 발명에 대한 다수의 변경 및 수정이 가능함을 당업자들은 잘 이해할 수 있을 것이다.Above, the present invention has been shown and described in connection with a preferred embodiment for illustrating the principles of the present invention, but the present invention is not limited to the configuration and operation as shown and described. Rather, those skilled in the art will appreciate that many changes and modifications to the present invention are possible without departing from the spirit and scope of the appended claims.

100 : 음성인식을 이용한 통화 서비스 시스템
200, 600 : 서비스 이용장치 205, 305, 405 : 통신부
210 : 마이크 215, 410 : 디스플레이부
220 : 스피커 225, 360, 425 : 제어부
230, 415 : 입력부 235, 315, 420 : 저장부
240 : 인쇄부 300 : 서비스 서버
310 : STT 변환부 320 : TA 분석부
325 : 분석 결과 DB 330 : 질문 답변 생성부
335 : 변역부 340 : 평가 기록부
345 : 이용자 관리부 350 : 컨설턴트 관리부
355 : 결제부 400 : 컨설팅 이용장치
500 : 네트워크 610 : 휴대 단말기100: call service system using voice recognition
200, 600: service use device 205, 305, 405: communication unit
210: microphone 215, 410: display unit
220: speaker 225, 360, 425: control unit
230, 415: input unit 235, 315, 420: storage unit
240: printing unit 300: service server
310: STT conversion unit 320: TA analysis unit
325: Analysis result DB 330: Question answer generator
335: translation unit 340: evaluation record book
345: User management department 350: Consultant management department
355: payment unit 400: consulting using device
500: network 610: mobile terminal

Claims

A plurality of service use devices for service users; And
It includes; a service server for communicating with the plurality of service use devices through a network to provide a call service to the plurality of service use devices; includes,
The service using device,
A microphone, a speaker, a display unit, a communication unit communicating with the service server via the network, a microphone, the speaker, the display unit, and a control unit electrically connected to the communication unit are provided.
The service server,
A communication unit communicating with the service using device through the network, an STT converting unit converting voice received from the service using device into text, a storage unit storing text converted by the STT converting unit, and the storage A TA analysis unit for analyzing text stored in the unit, an analysis result DB for recording the analysis result of the TA analysis unit, the STT conversion unit, the storage unit, the TA analysis unit, and the analysis result DB And a question answer generator that generates a database by generating a specific question and expected answers matching it from data stored in the analysis result DB.
The text converted by the STT conversion unit is provided to the service using device and displayed in real time on the display unit,
The storage unit stores the voice and other information or data received from the service using device,
When a service user requests his/her call content stored in the storage unit through the service using device, the service server searches for the call content stored in the storage unit and provides the call content requested by the user to the service using device and,
The service server repeats the process of receiving a voice from the plurality of service use devices, converting the received voice into text through the STT converter, and analyzing the text converted by the STT converter into the TA analyzer While distinguishing the questioner and the answerer among the plurality of users, if the text corresponding to the question stored in the question answer generating unit is generated among the text converted by the STT conversion unit, among the expected answers stored in the question answer generating unit A call service system using voice recognition, characterized in that the predicted answer matching the question is provided to the service use device of the answerer among the plurality of service use devices and displayed on the display unit of the service use device.

delete

According to claim 1,
The service server,
A call service using voice recognition, characterized in that it includes an evaluation recording unit that receives a result of the evaluation performed by the consultant who is authorized to evaluate the call contents and evaluates the call contents stored in the storage unit and records the evaluation result of the consultant. system.

The method of claim 4,
The service server,
When the user requests the consultant's evaluation of the content of his or her call, a call service system using voice recognition, characterized in that the user provides the evaluation result stored in the evaluation recording unit.

The method of claim 5,
The service server,
It includes a consultant management unit that can manage the information of authorized consultants to evaluate the contents of calls stored in the storage unit and provide the information to the service using device,
The service use apparatus receives the information of consultants from the consultant management unit and provides an interface through which a user can select a consultant who can be evaluated for the contents of his or her call, a call service system using voice recognition. .

The method of claim 5,
The service server,
Includes a payment unit for payment of service costs,
When the user requests the consultant's evaluation of the contents of his or her call, the voice recognition is characterized in that the user checks the payment through the payment unit and provides the evaluation result stored in the evaluation recording unit to the user only when payment is made. Call service system used.

The method of claim 4,
A communication service system using voice recognition, comprising: a consulting service device that communicates with the service server through the network and provides an interface through which a consultant can perform evaluation on the contents of calls stored in the storage unit. .

According to claim 1,
The service using device provides an interface through which a user can select text displayed on the display unit,
A call service system using speech recognition, characterized in that the display unit displays translated text obtained by translating text selected by a user into another language.

According to claim 1,
The service using device has an input unit,
A call service system using speech recognition, characterized in that the display unit displays translated text translated from another text through the input unit into another language.

(a) a plurality of users each accessing a service server through a network using a service using device;
(b) the service server receiving a voice from the plurality of service using devices;
(c) converting the voice received by the STT converter of the service server into text;
(d) the service server stores text converted by the STT conversion unit, voice received from the service using apparatus, and other information or data in a storage unit provided in the service server, and stores the plurality of services. Providing each to a use device and displaying in real time on a display unit provided in each of the plurality of service use devices;
(e) analyzing a text stored in the storage unit by a TA analysis unit provided in the service server;
(f) the analysis result DB provided in the service server, and records the results analyzed by the TA analysis unit by category;
(g) the step of generating a database by generating a question answer prediction unit matching the specific question from the data stored in the analysis result DB by the question answer generation unit of the service server;
(h) the service server receives voice from the plurality of service use devices, converts the received voice into text through the STT conversion unit, and analyzes the text converted by the STT conversion unit into the TA analysis unit When repeating a process, a questioner and an answerer are distinguished among the plurality of users, and if text corresponding to a question stored in the question answer generator is generated among texts converted by the STT converter, stored in the question answer generator Providing an expected answer matching the corresponding question among the expected answers to the service utilization device of the answerer among the plurality of service utilization devices to be displayed on the display unit of the corresponding service utilization device; And
When a service user requests his/her call content stored in the storage unit through the service using device, the service server searches for the call content stored in the storage unit and provides the call content requested by the user to the service using device Calling method using a voice recognition, characterized in that it comprises a.

delete

The method of claim 11,
After step (d),
(i) an evaluation record unit of the service server, receiving an evaluation result for the call content stored in the storage unit from a consultant authorized to evaluate the call content, and recording the evaluation result; And
(j) When the user requests the evaluation of the consultant for the contents of his/her call to the service server through his/her service using device, the service server retrieves the evaluation result for the user from the data of the evaluation record and applies Providing to the user's service using device; Call service method using voice recognition comprising a.

The method of claim 14,
Before step (i),
(k) storing information of authorized consultants so that the consultant management unit of the service server can evaluate a call, and providing a list and information of the stored consultants to the service using device; And
(l) When a user selects a specific consultant from a list of consultants provided to the service using device and requests evaluation of his/her call content to the service server through his/her service using device, the service server requests the consultant management unit And notifying the contact information of the registered consultant, and receiving an evaluation of the call content of the user requesting the evaluation from the consultant, a call service method using voice recognition.

The method of claim 11,
After step (d),
(m) providing an interface through which a user can select text displayed on the display unit, and displaying translated text for the text selected by the user on the display unit. Call service method used.

The method of claim 11,
(n) receiving text through the service using device, and displaying the translated text in which the text is translated into another language on the display unit.