[go: up one dir, main page]

CN103956167A - Visual sign language interpretation method and device based on Web - Google Patents

Visual sign language interpretation method and device based on Web Download PDF

Info

Publication number
CN103956167A
CN103956167A CN201410188860.2A CN201410188860A CN103956167A CN 103956167 A CN103956167 A CN 103956167A CN 201410188860 A CN201410188860 A CN 201410188860A CN 103956167 A CN103956167 A CN 103956167A
Authority
CN
China
Prior art keywords
sign language
web client
word segmentation
web
text information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410188860.2A
Other languages
Chinese (zh)
Inventor
傅湘玲
江帆
时雨霖
张笑燕
胡婕
刘茂铭
徐畅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN201410188860.2A priority Critical patent/CN103956167A/en
Publication of CN103956167A publication Critical patent/CN103956167A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

本发明公开了一种基于Web的可视化手语翻译方法和设备。该方法包括:从Web客户端接收语音信号;对该语音信号进行识别,并生成相对应的文本信息;对该文本信息进行分词处理,并得到至少一个分词;从动画库中获取与至少一个分词中的每个分词对应的手语动画,并形成手语动画序列;以及向Web客户端发送手语动画序列,以由该Web客户端按照手语动画序列进行手语播放。由此,可以使得不懂得手语的人群通过输入语音信息就可以便捷地与聋哑人进行交流,而不需要繁琐的手动输入,同时还能满足不懂手语的人学习手语的需求。此外,由于本发明提供的方法和设备是基于Web的,不需要使用第三方软件,因而使得手语翻译可以随时随地在线进行。

The invention discloses a web-based visual sign language translation method and equipment. The method includes: receiving a voice signal from a web client; identifying the voice signal, and generating corresponding text information; performing word segmentation processing on the text information, and obtaining at least one word segmentation; obtaining at least one word segmentation from the animation library The sign language animation corresponding to each participle in , and form the sign language animation sequence; and send the sign language animation sequence to the web client, so that the web client performs sign language playback according to the sign language animation sequence. As a result, people who do not understand sign language can conveniently communicate with deaf-mute people by inputting voice information without cumbersome manual input, and at the same time meet the needs of people who do not understand sign language to learn sign language. In addition, since the method and device provided by the present invention are based on the Web, no third-party software is required, so sign language translation can be performed online anytime and anywhere.

Description

一种基于Web的可视化手语翻译方法及设备A web-based visual sign language translation method and device

技术领域 technical field

本发明涉及手语翻译领域,具体地,涉及一种基于Web的可视化手语翻译方法及设备。  The invention relates to the field of sign language translation, in particular to a Web-based visual sign language translation method and device. the

背景技术 Background technique

在现实生活中,为了便于不懂手语的人群与聋哑人群交流沟通,需要进行手语翻译。然而,现有的手语翻译系统大部分是将手语翻译成语音或文字。这使得不懂手语的人群不能主动方便地通过语音与聋哑人群发起交流,对不懂手语的人群造成了不便。而聋哑人群也无法通过可视化的手语动作来直观地获悉与之交流的人所要表达的信息。此外,由于必须使用手语与这种手语翻译系统进行交互,因而这种手语翻译系统无法满足不懂手语的人群学习手语的需求。  In real life, in order to facilitate communication between people who do not understand sign language and deaf people, sign language interpretation is required. However, most of the existing sign language translation systems translate sign language into speech or text. This prevents people who do not understand sign language from actively and conveniently communicating with deaf people through voice, which causes inconvenience to people who do not understand sign language. And deaf-mute people can't intuitively understand the information that the person communicating with them wants to express through visual sign language movements. In addition, since sign language must be used to interact with the sign language translation system, the sign language translation system cannot meet the needs of people who do not understand sign language to learn sign language. the

还有一些手语翻译系统能够实现手语和语音的双向翻译,但是这种系统是基于第三方客户端软件的。也就是说,在使用前,必须在客户端安装由第三方提供的翻译软件(可能还需要安装相应的数据库)。之后,通过运行该翻译软件来进行手语翻译。由于使用第三方客户端软件,使得这种翻译系统具有非常大的局限性。例如,当客户端的配置不满足安装要求(例如,客户端的存储空间小,不能满足安装相关软件和数据库的空间需求)时,导致无法安装该软件和相应的数据库,也就无法进行手语翻译。或者,当用户使用的客户端发生变化时,用户必须在新的客户端上安装该第三方客户端软件,才能进行手语翻译,这对于用户而言是十分不便的。  There are also some sign language translation systems that can realize two-way translation of sign language and voice, but this system is based on third-party client software. That is to say, before using, the translation software provided by the third party must be installed on the client (and the corresponding database may also need to be installed). After that, sign language translation is performed by running the translation software. Due to the use of third-party client software, this translation system has very large limitations. For example, when the configuration of the client does not meet the installation requirements (for example, the storage space of the client is too small to meet the space requirements for installing related software and databases), the software and the corresponding database cannot be installed, and sign language interpretation cannot be performed. Or, when the client used by the user changes, the user must install the third-party client software on the new client to perform sign language translation, which is very inconvenient for the user. the

发明内容 Contents of the invention

本发明的目的是提供一种便捷、直观的基于Web的可视化手语翻译方法和设备。  The purpose of the present invention is to provide a convenient and intuitive Web-based visual sign language translation method and device. the

为了实现上述目的,本发明提供一种基于Web的可视化手语翻译方法,该方法包括:从Web客户端接收语音信号;对该语音信号进行识别,并生成相对应的文本信息;对该文本信息进行分词处理,并得到至少一个分词;从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列;以及向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放。  In order to achieve the above object, the present invention provides a Web-based visual sign language translation method, the method comprising: receiving a voice signal from a Web client; identifying the voice signal, and generating corresponding text information; Segmentation processing, and obtain at least one participle; Obtain the sign language animation corresponding to each participle in the at least one participle from the animation library, and form a sign language animation sequence; and send the sign language animation sequence to the Web client, The web client can perform sign language playback according to the sign language animation sequence. the

本发明还提供一种基于Web的可视化手语翻译设备,该设备包括:用于从Web客户端接收语音信号的装置;用于对该语音信号进行识别,并生成相对应的文本信息的装置;用于对该文本信息进行分词处理,并得到至少一个分词的装置;用于从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列的装置;以及用于向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放的装置。  The present invention also provides a Web-based visual sign language translation device, which includes: a device for receiving a voice signal from a Web client; a device for recognizing the voice signal and generating corresponding text information; A device for performing word segmentation processing on the text information to obtain at least one word segment; a device for obtaining a sign language animation corresponding to each of the at least one word segment from the animation library, and forming a sign language animation sequence; and using A device for sending the sign language animation sequence to the web client, so that the web client performs sign language playback according to the sign language animation sequence. the

通过上述技术方案,可以将语音信息翻译成手语信息。这样,可以使得不懂得手语的人群通过输入语音信息就可以便捷地与聋哑人进行交流,而不需要繁琐的手动输入,同时还能满足不懂手语的人学习手语的需求。另外,由于本发明提供的手语翻译方法和设备是基于Web的,因而使得手语翻译更加简单方便。用户只需要通过Web客户端登录网址,就可以进行手语翻译,不需要使用第三方软件,也就省去了下载和安装第三方软件的繁复过程,避免了第三方软件对客户端的要求限制,使得手语翻译可以随时随地在线进行。  Through the above technical solution, voice information can be translated into sign language information. In this way, people who do not understand sign language can conveniently communicate with deaf-mute people by inputting voice information, without cumbersome manual input, and at the same time meet the needs of people who do not understand sign language to learn sign language. In addition, since the sign language translation method and device provided by the present invention are based on the Web, the sign language translation is simpler and more convenient. Users only need to log in to the website through the web client to perform sign language translation without using third-party software, which saves the complicated process of downloading and installing third-party software, and avoids the restrictions imposed by third-party software on the client. Sign language interpretation can be done online anytime, anywhere. the

本发明的其他特征和优点将在随后的具体实施方式部分予以详细说明。  Other features and advantages of the present invention will be described in detail in the following detailed description. the

附图说明 Description of drawings

附图是用来提供对本发明的进一步理解,并且构成说明书的一部分,与下面的具体实施方式一起用于解释本发明,但并不构成对本发明的限制。在附图中:  The accompanying drawings are used to provide a further understanding of the present invention, and constitute a part of the description, together with the following specific embodiments, are used to explain the present invention, but do not constitute a limitation to the present invention. In the attached picture:

图1是根据本发明的实施方式的基于Web的可视化手语翻译方法的流程图。  FIG. 1 is a flowchart of a Web-based visual sign language translation method according to an embodiment of the present invention. the

具体实施方式 Detailed ways

以下结合附图对本发明的具体实施方式进行详细说明。应当理解的是,此处所描述的具体实施方式仅用于说明和解释本发明,并不用于限制本发明。  Specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings. It should be understood that the specific embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention. the

图1示出了根据本发明的实施方式的基于Web的可视化手语翻译方法的流程图。如图1所示,该方法可以包括:步骤S101,从Web客户端接收语音信号;步骤S102,对该语音信号进行识别,并生成相对应的文本信息(例如,该文本信息是基于普通话标准的);步骤S103,对该文本信息进行分词处理,并得到至少一个分词;步骤S104,从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列;以及步骤S105,向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放。  Fig. 1 shows a flowchart of a Web-based visual sign language translation method according to an embodiment of the present invention. As shown in Figure 1, the method may include: step S101, receiving a voice signal from a Web client; step S102, recognizing the voice signal, and generating corresponding text information (for example, the text information is based on the Mandarin standard ); step S103, carry out word segmentation processing to this text information, and obtain at least one word; Step S104, obtain the sign language animation corresponding to each word in the at least one word part from the animation storehouse, and form the sign language animation sequence; And Step S105, sending the sign language animation sequence to the web client, so that the web client can perform sign language playback according to the sign language animation sequence. the

所述Web客户端可以例如是浏览器。若用户想要进行语音-手语翻译的应用,该用户只需在浏览器上输入相应网址即可。在登录到该网址后,用户可通过麦克风等语音输入设备来向该Web客户端输入语音信号。之后,通过本发明提供的手语翻译方法就可以实现语音-手语的翻译。  The web client can be, for example, a browser. If the user wants to implement the speech-sign language translation application, the user only needs to input the corresponding URL on the browser. After logging in to the website, the user can input a voice signal to the Web client through a voice input device such as a microphone. Afterwards, the speech-sign language translation can be realized through the sign language translation method provided by the present invention. the

在本发明提供的手语翻译方法中,首先,从Web客户端接收语音信号(即,用户向Web客户端输入的语音信息)。之后,对该语音信号进行识别, 并生成相对应的文本信息。  In the sign language interpretation method provided by the present invention, firstly, a voice signal (that is, voice information input by the user to the Web client) is received from the Web client. Afterwards, the voice signal is recognized, and corresponding text information is generated. the

在本发明的一个实施方式中,可以利用语音识别引擎来进行语音信号的识别。例如,该语音识别引擎可以是谷歌语音识别接口(Google Speech Recognition Interface)。这种语音识别引擎的优点在于识别精度高,调用方法简单,并且可以支持多语种。可替换地,该语音识别引擎可以是科大讯飞语音识别应用程序接口(API)。应当注意的是,除了以上列出的两个示例,在本发明中,其他类型的语音识别引擎也可以被使用。此外,用户还可以根据需要来方便地任意更换语音识别引擎,或增加语音识别引擎来支持多语言。  In one embodiment of the present invention, a speech recognition engine may be used to recognize speech signals. For example, the speech recognition engine may be Google Speech Recognition Interface (Google Speech Recognition Interface). The advantages of this speech recognition engine are high recognition accuracy, simple calling method, and multilingual support. Alternatively, the speech recognition engine may be iFLYTEK speech recognition application program interface (API). It should be noted that in addition to the two examples listed above, other types of speech recognition engines may also be used in the present invention. In addition, users can easily replace the speech recognition engine as needed, or add speech recognition engines to support multiple languages. the

在对语音信号进行识别之后,可以生成与该语音信号对应的文本信息。在得到所述文本信息之后,可以对该文本信息进行分词处理。在本发明中,可采用本领域的技术人员公知的分词处理方法来对文本信息进行分词。在分词之后,可以得到至少一个分词。例如,经识别得到的文本信息为“我想吃饭”,那么,对该文本信息进行分词处理之后,可以得到三个分词,分别为“我”、“想”、“吃饭”。  After the voice signal is recognized, text information corresponding to the voice signal can be generated. After the text information is obtained, word segmentation processing may be performed on the text information. In the present invention, word segmentation processing methods known to those skilled in the art can be used to segment text information. After the word segmentation, at least one word segmentation can be obtained. For example, if the recognized text information is "I want to eat", then, after word segmentation processing is performed on the text information, three word segments can be obtained, namely "I", "want", and "eat". the

在得到至少一个分词之后,下一步,从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列。所述动画库中可以存储有大量词语和与每个词语对应的手语动画。在得到所述至少一个分词之后,可以根据所述至少一个分词,来从动画库中提取出与每个分词对应的手语动画。例如,假设三个分词分别为“我”、“想”、“吃饭”。那么,可以根据这三个分词,从动画库中提取出与分词“我”对应的手语动画,与“想”对应的手语动画,以及与“吃饭”对应的手语动画。在提取出相应的手语动画之后,可以按照分词顺序将这些手语动画组成手语动画序列。  After obtaining at least one participle, the next step is to acquire the sign language animation corresponding to each participle in the at least one participle from the animation library, and form a sign language animation sequence. A large number of words and sign language animations corresponding to each word may be stored in the animation library. After the at least one participle is obtained, the sign language animation corresponding to each participle may be extracted from the animation library according to the at least one participle. For example, suppose the three participles are "I", "want", and "eat" respectively. Then, according to these three participles, the sign language animation corresponding to the participle "I", the sign language animation corresponding to "want", and the sign language animation corresponding to "eating" can be extracted from the animation library. After the corresponding sign language animations are extracted, these sign language animations can be combined into a sign language animation sequence according to the sequence of word segmentation. the

下一步,将所述手语动画序列发送至所述Web客户端,以由该Web客户端按照所述手语动画序列进行手语播放。  In the next step, the sign language animation sequence is sent to the web client, so that the web client performs sign language playback according to the sign language animation sequence. the

在所述Web客户端中可以使用3D播放器(例如,unity3d web player) 来进行手语播放。在这种情况下,本发明提供的手语翻译方法还包括:在从所述Web客户端接收语音信号之前,将用于播放手语动画的3D模型加载至所述Web客户端的步骤。  A 3D player (for example, unity3d web player) can be used in the web client to play sign language. In this case, the sign language translation method provided by the present invention further includes: before receiving the voice signal from the Web client, loading the 3D model used for playing the sign language animation to the Web client. the

在将3D模型加载至Web客户端之后,用户可在该Web客户端上观看到一3D模型。例如,该3D模型可以是一3D虚拟人。该Web客户端在接收到手语动画序列之后,可以将该手语动画序列载入至其上的播放器。之后,该播放器可以自动解析该手语动画序列,并按照该手语动画序列来控制3D虚拟人动作。这样,用户就可以通过该3D虚拟人来可视化地观看手语动作。  After the 3D model is loaded to the web client, the user can view a 3D model on the web client. For example, the 3D model can be a 3D virtual human. After the web client receives the sign language animation sequence, it can load the sign language animation sequence into its player. Afterwards, the player can automatically analyze the sign language animation sequence, and control the 3D virtual human action according to the sign language animation sequence. In this way, the user can visually watch sign language actions through the 3D virtual human. the

除了利用3D播放器来展示3D手语播放,也可以使用下一代网络技术中的网页3D技术来实现在Web客户端上展示3D手语播放。  In addition to using a 3D player to display 3D sign language playback, the webpage 3D technology in the next generation network technology can also be used to display 3D sign language playback on a Web client. the

由此,通过上述技术方案,可以将语音信息翻译成手语信息。这样,可以使得不懂得手语的人群通过输入语音信息就可以便捷地与聋哑人进行交流,而不需要繁琐的手动输入,同时还能满足不懂手语的人学习手语的需求。另外,由于本发明提供的手语翻译方法是基于Web的,因而使得手语翻译更加简单方便。用户只需要通过Web客户端登录服务器网址,就可以进行手语翻译,不需要使用第三方软件,也就省去了下载和安装第三方软件的繁复过程,避免了第三方软件对客户端的要求限制,使得手语翻译可以随时随地在线进行。  Thus, through the above technical solution, voice information can be translated into sign language information. In this way, people who do not understand sign language can conveniently communicate with deaf-mute people by inputting voice information without cumbersome manual input, and at the same time meet the needs of people who do not understand sign language to learn sign language. In addition, since the sign language translation method provided by the present invention is based on the Web, the sign language translation is simpler and more convenient. Users only need to log in to the server website through the web client to perform sign language translation without using third-party software, which saves the complicated process of downloading and installing third-party software, and avoids the requirements of third-party software on the client. This enables sign language interpretation to be performed online anytime, anywhere. the

在Web客户端接收到用户输入的语音信号之后,该语音信号可由该Web客户端进行播放。具体地,该Web客户端可包括一语音合成模块。在Web客户端接收到用户输入的语音信号之后,该Web客户端可以通过所述语音合成模块将所述语音信号合成为一音频文件。之后,可将该音频文件传输至播放器进行播放。优选地,播放器同步播放手语动画和所述音频文件。这样,用户可在观看手语动作的同时,还能够收听相对应的语音。通过声形并茂地进行手语展示,有利于提高不懂手语的人群进行手语学习的效果。  After the web client receives the voice signal input by the user, the voice signal can be played by the web client. Specifically, the Web client may include a speech synthesis module. After the web client receives the speech signal input by the user, the web client can synthesize the speech signal into an audio file through the speech synthesis module. Afterwards, the audio file can be transferred to a player for playback. Preferably, the player synchronously plays the sign language animation and the audio file. In this way, the user can listen to the corresponding voice while watching the sign language action. The display of sign language through sound and form is beneficial to improve the effect of sign language learning for people who do not understand sign language. the

在本发明的另一实施方式中,所述方法还可以包括:向所述Web客户端发送所述文本信息,以由该Web客户端显示所述文本信息。通过这一方式,可以使得用户在观看手语动作的同时,还能够同步看到文字(可被称为“字幕”)。这样,可以为聋哑人群多提供一种交流信息获取方式。  In another embodiment of the present invention, the method may further include: sending the text information to the Web client, so that the Web client can display the text information. In this way, the user can simultaneously see the text (which may be called "subtitles") while watching the gestures in sign language. In this way, one more way of obtaining communication information can be provided for deaf-mute people. the

在本发明的一个优选的实施方式中,所述分词处理的步骤还可以包括对分词进行优化处理,并且所得到的至少一个分词是经优化处理后的分词。其中,所述优化处理可以包括以下操作中的至少一者:去除、替换、顺序调换。  In a preferred embodiment of the present invention, the word segmentation processing step may also include optimizing the word segmentation, and the obtained at least one word segmentation is an optimized word segmentation. Wherein, the optimization process may include at least one of the following operations: removal, replacement, and order exchange. the

例如,在得到文本信息之后,可以先对该文本信息进行分词,之后,利用手语词库中存储的手语词汇,来对这些分词进行筛选。这样做的目的在于要从这些分词中去除掉手语词库中没有的分词(即,手语表达中没有的分词)。在去除掉手语表达中没有的分词之后,还可以对剩下的分词进行替换和/或顺序调换,从而使得手语翻译结果更加符合手语的语法和规范。在经过上述优化处理之后,可以根据经优化处理后得到的至少一个分词来从动画库中获取与所述至少一个分词中的每个分词对应的手语动画。  For example, after the text information is obtained, the text information can be segmented into words first, and then these word segments can be screened by using the sign language vocabulary stored in the sign language thesaurus. The purpose of doing this is to remove the participle not in the sign language lexicon (that is, the participle not in the sign language expression) from these participle words. After removing the participle that is not in the sign language expression, the remaining participle can be replaced and/or their order changed, so that the sign language translation result is more in line with the grammar and norms of the sign language. After the above optimization process, the sign language animation corresponding to each participle in the at least one participle may be obtained from the animation library according to the at least one participle obtained after the optimization process. the

本发明还提供一种基于Web的可视化手语翻译设备,该设备可以包括:用于从Web客户端接收语音信号的装置;用于对该语音信号进行识别,并生成相对应的文本信息的装置;用于对该文本信息进行分词处理,并得到至少一个分词的装置;用于从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列的装置;以及用于向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放的装置。  The present invention also provides a Web-based visual sign language translation device, which may include: a device for receiving a voice signal from a Web client; a device for recognizing the voice signal and generating corresponding text information; A device for segmenting the text information and obtaining at least one segment; a device for obtaining a sign language animation corresponding to each segment in the at least one segment from the animation library, and forming a sequence of sign language animations; and A device for sending the sign language animation sequence to the web client, so that the web client performs sign language playback according to the sign language animation sequence. the

其中,所述语音信号可以通过使用语音识别引擎来被识别。此外,所述语音信号还能够在由所述Web客户端按照所述手语动画序列进行手语播放的同时,由该Web客户端进行播放。  Wherein, the speech signal can be recognized by using a speech recognition engine. In addition, the voice signal can also be played by the Web client while the Web client performs sign language playback according to the sign language animation sequence. the

在另一实施方式中,该设备还可以包括:用于向所述Web客户端发送 所述文本信息,以由该Web客户端显示所述文本信息的装置。  In another embodiment, the device may also include: means for sending the text information to the Web client, so that the Web client displays the text information. the

在另一实施方式中,所述分词处理可以包括对分词进行优化处理,并且所得到的至少一个分词是经优化处理后的分词,其中,所述优化处理包括以下操作中的至少一者:去除、替换、顺序调换。  In another embodiment, the word segmentation processing may include optimizing the word segmentation, and the obtained at least one word segmentation is an optimized word segment, wherein the optimization processing includes at least one of the following operations: removing , replacement, order exchange. the

在另一实施方式中,该设备还可以包括:用于在从所述Web客户端接收语音信号之前,将用于播放手语动画的3D模型加载至所述Web客户端的装置。  In another implementation manner, the device may further include: means for loading the 3D model for playing sign language animation to the web client before receiving the voice signal from the web client. the

由此,通过本发明提供的基于Web的可视化手语翻译方法和设备,可以将语音信息翻译成手语信息。这样,可以使得不懂得手语的人群通过输入语音信息就可以便捷地与聋哑人进行交流,而不需要繁琐的手动输入,同时还能满足不懂手语的人学习手语的需求。另外,由于本发明提供的手语翻译方法和设备是基于Web的,因而使得手语翻译更加简单方便。用户只需要通过Web客户端登录服务器网址,就可以进行手语翻译,不需要使用第三方软件,也就省去了下载和安装第三方软件的繁复过程,避免了第三方软件对客户端的要求限制,使得手语翻译可以随时随地在线进行。  Thus, voice information can be translated into sign language information through the Web-based visual sign language translation method and device provided by the present invention. In this way, people who do not understand sign language can conveniently communicate with deaf-mute people by inputting voice information without cumbersome manual input, and at the same time meet the needs of people who do not understand sign language to learn sign language. In addition, since the sign language translation method and device provided by the present invention are based on the Web, the sign language translation is simpler and more convenient. Users only need to log in to the server website through the web client to perform sign language translation without using third-party software, which saves the complicated process of downloading and installing third-party software, and avoids the requirements of third-party software on the client. This enables sign language interpretation to be performed online anytime, anywhere. the

以上结合附图详细描述了本发明的优选实施方式,但是,本发明并不限于上述实施方式中的具体细节,在本发明的技术构思范围内,可以对本发明的技术方案进行多种简单变型,这些简单变型均属于本发明的保护范围。  The preferred embodiment of the present invention has been described in detail above in conjunction with the accompanying drawings, but the present invention is not limited to the specific details of the above embodiment, within the scope of the technical concept of the present invention, various simple modifications can be made to the technical solution of the present invention, These simple modifications all belong to the protection scope of the present invention. the

另外需要说明的是,在上述具体实施方式中所描述的各个具体技术特征,在不矛盾的情况下,可以通过任何合适的方式进行组合。为了避免不必要的重复,本发明对各种可能的组合方式不再另行说明。  In addition, it should be noted that the various specific technical features described in the above specific implementation manners may be combined in any suitable manner if there is no contradiction. In order to avoid unnecessary repetition, various possible combinations are not further described in the present invention. the

此外,本发明的各种不同的实施方式之间也可以进行任意组合,只要其不违背本发明的思想,其同样应当视为本发明所公开的内容。  In addition, various combinations of different embodiments of the present invention can also be combined arbitrarily, as long as they do not violate the idea of the present invention, they should also be regarded as the disclosed content of the present invention. the

Claims (12)

1.一种基于Web的可视化手语翻译方法,其特征在于,该方法包括:1. A Web-based visual sign language translation method, characterized in that the method comprises: 从Web客户端接收语音信号;Receive voice signal from web client; 对该语音信号进行识别,并生成相对应的文本信息;Recognize the voice signal and generate corresponding text information; 对该文本信息进行分词处理,并得到至少一个分词;performing word segmentation processing on the text information, and obtaining at least one word segmentation; 从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列;以及Acquiring the sign language animation corresponding to each participle in the at least one participle from the animation library, and forming a sign language animation sequence; and 向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放。Sending the sign language animation sequence to the Web client, so that the Web client performs sign language playback according to the sign language animation sequence. 2.根据权利要求1所述的方法,其特征在于,所述语音信号是通过使用语音识别引擎来被识别的。2. The method of claim 1, wherein the speech signal is recognized using a speech recognition engine. 3.根据权利要求1所述的方法,其特征在于,所述语音信号能够在由所述Web客户端按照所述手语动画序列进行手语播放的同时,由该Web客户端进行播放。3. The method according to claim 1, wherein the voice signal can be played by the Web client while the Web client performs sign language playback according to the sign language animation sequence. 4.根据权利要求1所述的方法,其特征在于,该方法还包括:向所述Web客户端发送所述文本信息,以由该Web客户端显示所述文本信息。4. The method according to claim 1, further comprising: sending the text information to the Web client, so that the Web client can display the text information. 5.根据权利要求1-4中任一权利要求所述的方法,其特征在于,所述分词处理包括对分词进行优化处理,并且所得到的至少一个分词是经优化处理后的分词,其中,所述优化处理包括以下操作中的至少一者:去除、替换、顺序调换。5. The method according to any one of claims 1-4, wherein the word segmentation processing includes optimizing the word segmentation, and the obtained at least one word segmentation is an optimized word segmentation, wherein, The optimization process includes at least one of the following operations: removal, replacement, and order exchange. 6.根据权利要求1-4中任一权利要求所述的方法,其特征在于,该方法还包括:在从所述Web客户端接收语音信号之前,将用于播放手语动画的3D模型加载至所述Web客户端。6. The method according to any one of claims 1-4, further comprising: before receiving the voice signal from the Web client, loading the 3D model for playing sign language animation to The web client. 7.一种基于Web的可视化手语翻译设备,其特征在于,该设备包括:7. A Web-based visual sign language translation device, characterized in that the device includes: 用于从Web客户端接收语音信号的装置;means for receiving voice signals from a web client; 用于对该语音信号进行识别,并生成相对应的文本信息的装置;A device for recognizing the voice signal and generating corresponding text information; 用于对该文本信息进行分词处理,并得到至少一个分词的装置;A device for performing word segmentation processing on the text information and obtaining at least one word segmentation; 用于从动画库中获取与所述至少一个分词中的每个分词对应的手语动画,并形成手语动画序列的装置;以及A device for obtaining a sign language animation corresponding to each participle in the at least one participle from the animation library, and forming a sign language animation sequence; and 用于向所述Web客户端发送所述手语动画序列,以由该Web客户端按照所述手语动画序列进行手语播放的装置。A device for sending the sign language animation sequence to the web client, so that the web client performs sign language playback according to the sign language animation sequence. 8.根据权利要求7所述的设备,其特征在于,所述语音信号是通过使用语音识别引擎来被识别的。8. The device of claim 7, wherein the speech signal is recognized using a speech recognition engine. 9.根据权利要求7所述的设备,其特征在于,所述语音信号能够在由所述Web客户端按照所述手语动画序列进行手语播放的同时,由该Web客户端进行播放。9 . The device according to claim 7 , wherein the voice signal can be played by the Web client while the Web client performs sign language playback according to the sign language animation sequence. 10.根据权利要求7所述的设备,其特征在于,该设备还包括:用于向所述Web客户端发送所述文本信息,以由该Web客户端显示所述文本信息的装置。10. The device according to claim 7, further comprising: means for sending the text information to the Web client, so that the Web client can display the text information. 11.根据权利要求7-10中任一权利要求所述的设备,其特征在于,所述分词处理包括对分词进行优化处理,并且所得到的至少一个分词是经优化处理后的分词,其中,所述优化处理包括以下操作中的至少一者:去除、替换、顺序调换。11. The device according to any one of claims 7-10, wherein the word segmentation processing includes optimizing the word segmentation, and the obtained at least one word segmentation is an optimized word segmentation, wherein, The optimization process includes at least one of the following operations: removal, replacement, and order exchange. 12.根据权利要求7-10中任一权利要求所述的设备,其特征在于,该设备还包括:用于在从所述Web客户端接收语音信号之前,将用于播放手语动画的3D模型加载至所述Web客户端的装置。12. The device according to any one of claims 7-10, further comprising: a 3D model for playing sign language animation before receiving the voice signal from the Web client A device loaded into the web client.
CN201410188860.2A 2014-05-06 2014-05-06 Visual sign language interpretation method and device based on Web Pending CN103956167A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410188860.2A CN103956167A (en) 2014-05-06 2014-05-06 Visual sign language interpretation method and device based on Web

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410188860.2A CN103956167A (en) 2014-05-06 2014-05-06 Visual sign language interpretation method and device based on Web

Publications (1)

Publication Number Publication Date
CN103956167A true CN103956167A (en) 2014-07-30

Family

ID=51333433

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410188860.2A Pending CN103956167A (en) 2014-05-06 2014-05-06 Visual sign language interpretation method and device based on Web

Country Status (1)

Country Link
CN (1) CN103956167A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462074A (en) * 2014-12-26 2015-03-25 北京奇虎科技有限公司 Method and device for conducting webpage data translation and browser client side
WO2017161741A1 (en) * 2016-03-23 2017-09-28 乐视控股(北京)有限公司 Method and device for communicating information with deaf-mutes, smart terminal
CN107562738A (en) * 2017-09-08 2018-01-09 合肥安华信息科技有限公司 A kind of sign language interpretation method based on user's request
CN107610205A (en) * 2017-09-20 2018-01-19 珠海金山网络游戏科技有限公司 Webpage input audio is generated to the methods, devices and systems of mouth shape cartoon based on HTML5
CN107798964A (en) * 2017-11-24 2018-03-13 郑军 The sign language intelligent interaction device and its exchange method of a kind of Real time identification gesture
CN108803871A (en) * 2018-05-07 2018-11-13 歌尔科技有限公司 It wears the output method of data content, device in display equipment and wears display equipment
CN109166409A (en) * 2018-10-10 2019-01-08 长沙千博信息技术有限公司 A kind of sign language conversion method and device
CN109409255A (en) * 2018-10-10 2019-03-01 长沙千博信息技术有限公司 A kind of sign language scene generating method and device
CN110598576A (en) * 2019-08-21 2019-12-20 腾讯科技(深圳)有限公司 Sign language interaction method and device and computer medium
CN110890097A (en) * 2019-11-21 2020-03-17 京东数字科技控股有限公司 Voice processing method and device, computer storage medium and electronic equipment
CN111090998A (en) * 2018-10-18 2020-05-01 北京搜狗科技发展有限公司 A sign language conversion method, device and device for sign language conversion
CN111857934A (en) * 2020-07-29 2020-10-30 香港乐蜜有限公司 A page loading method, device, electronic device and storage medium
CN113706977A (en) * 2020-08-13 2021-11-26 苏州韵果莘莘影视科技有限公司 Playing method and system based on intelligent sign language translation software
CN114708367A (en) * 2022-03-28 2022-07-05 长沙千博信息技术有限公司 Sign language digital human driver and real-time rendering system based on WebGL

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462074A (en) * 2014-12-26 2015-03-25 北京奇虎科技有限公司 Method and device for conducting webpage data translation and browser client side
CN104462074B (en) * 2014-12-26 2018-04-10 北京奇虎科技有限公司 A kind of method, apparatus and browser client for carrying out web data translation
WO2017161741A1 (en) * 2016-03-23 2017-09-28 乐视控股(北京)有限公司 Method and device for communicating information with deaf-mutes, smart terminal
CN107562738A (en) * 2017-09-08 2018-01-09 合肥安华信息科技有限公司 A kind of sign language interpretation method based on user's request
CN107610205A (en) * 2017-09-20 2018-01-19 珠海金山网络游戏科技有限公司 Webpage input audio is generated to the methods, devices and systems of mouth shape cartoon based on HTML5
CN107798964A (en) * 2017-11-24 2018-03-13 郑军 The sign language intelligent interaction device and its exchange method of a kind of Real time identification gesture
CN108803871A (en) * 2018-05-07 2018-11-13 歌尔科技有限公司 It wears the output method of data content, device in display equipment and wears display equipment
CN109166409A (en) * 2018-10-10 2019-01-08 长沙千博信息技术有限公司 A kind of sign language conversion method and device
CN109409255A (en) * 2018-10-10 2019-03-01 长沙千博信息技术有限公司 A kind of sign language scene generating method and device
CN111090998A (en) * 2018-10-18 2020-05-01 北京搜狗科技发展有限公司 A sign language conversion method, device and device for sign language conversion
CN110598576A (en) * 2019-08-21 2019-12-20 腾讯科技(深圳)有限公司 Sign language interaction method and device and computer medium
CN110890097A (en) * 2019-11-21 2020-03-17 京东数字科技控股有限公司 Voice processing method and device, computer storage medium and electronic equipment
CN111857934A (en) * 2020-07-29 2020-10-30 香港乐蜜有限公司 A page loading method, device, electronic device and storage medium
CN113706977A (en) * 2020-08-13 2021-11-26 苏州韵果莘莘影视科技有限公司 Playing method and system based on intelligent sign language translation software
CN114708367A (en) * 2022-03-28 2022-07-05 长沙千博信息技术有限公司 Sign language digital human driver and real-time rendering system based on WebGL

Similar Documents

Publication Publication Date Title
CN103956167A (en) Visual sign language interpretation method and device based on Web
US10614803B2 (en) Wake-on-voice method, terminal and storage medium
US20190311709A1 (en) Computerized system and method for formatted transcription of multimedia content
CN107464555B (en) Method, computing device and medium for enhancing audio data including speech
CN112309365B (en) Training method and device of speech synthesis model, storage medium and electronic equipment
US10950254B2 (en) Producing comprehensible subtitles and captions for an effective group viewing experience
US10991380B2 (en) Generating visual closed caption for sign language
US11176141B2 (en) Preserving emotion of user input
US9047868B1 (en) Language model data collection
US20140358516A1 (en) Real-time, bi-directional translation
CN104681023A (en) Information processing method and electronic equipment
JP6233798B2 (en) Apparatus and method for converting data
CN105426362A (en) Speech Translation Apparatus And Method
CN109256133A (en) A kind of voice interactive method, device, equipment and storage medium
TWI509432B (en) Electronic device and language analysis method thereof
US20190325067A1 (en) Generating descriptive text contemporaneous to visual media
CN109241286A (en) Method and apparatus for generating text
CN111142667A (en) System and method for generating voice based on text mark
WO2021227308A1 (en) Video resource generation method and apparatus
CN107705782A (en) Method and apparatus for determining phoneme pronunciation duration
KR101385316B1 (en) System and method for providing conversation service connected with advertisements and contents using robot
CN114064943A (en) Conference management method, conference management device, storage medium and electronic equipment
CN118689347A (en) Intelligent agent generation method, interaction method, device, medium and equipment
CN110379406B (en) Voice comment conversion method, system, medium and electronic device
KR20210050410A (en) Method and system for suppoting content editing based on real time generation of synthesized sound for video content

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140730