[go: up one dir, main page]

CN112562639B - Audio processing method, terminal and computer readable storage medium - Google Patents

Audio processing method, terminal and computer readable storage medium Download PDF

Info

Publication number
CN112562639B
CN112562639B CN202011373395.1A CN202011373395A CN112562639B CN 112562639 B CN112562639 B CN 112562639B CN 202011373395 A CN202011373395 A CN 202011373395A CN 112562639 B CN112562639 B CN 112562639B
Authority
CN
China
Prior art keywords
terminal
audio information
audio
information
preset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011373395.1A
Other languages
Chinese (zh)
Other versions
CN112562639A (en
Inventor
常磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nubia Technology Co Ltd
Original Assignee
Nubia Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nubia Technology Co Ltd filed Critical Nubia Technology Co Ltd
Priority to CN202011373395.1A priority Critical patent/CN112562639B/en
Publication of CN112562639A publication Critical patent/CN112562639A/en
Application granted granted Critical
Publication of CN112562639B publication Critical patent/CN112562639B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/013Adapting to target pitch
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/14Session management
    • H04L67/141Setup of application sessions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • H04L69/161Implementation details of TCP/IP or UDP/IP stack architecture; Specification of modified or new header fields
    • H04L69/162Implementation details of TCP/IP or UDP/IP stack architecture; Specification of modified or new header fields involving adaptations of sockets based mechanisms

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Security & Cryptography (AREA)
  • Telephone Function (AREA)

Abstract

The application provides an audio processing method which is applied to a first terminal, wherein the first terminal comprises an audio processing module, and the audio processing method comprises the following steps: the first terminal acquires audio information, and performs preset processing on the audio information through the audio processing module to obtain target audio information; the first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information; and the second terminal and the first terminal establish communication connection through a preset communication protocol. The application also provides a terminal and a computer readable storage medium. The audio processing module of the first terminal is utilized to perform preset processing on the audio information, so that the target audio information obtained through the preset processing has a better effect, and the second terminal can obtain the target audio information and perform preset operation on the target audio information.

Description

Audio processing method, terminal and computer readable storage medium
Technical Field
The present application relates to the field of information processing technologies, and in particular, to an audio processing method, a terminal, and a computer readable storage medium.
Background
At present, in the process of live broadcasting room and convection and teammate communication of game application, a mobile terminal such as a mobile phone generally obtains microphone audio information, synthesizes and sound-changing processing is carried out on the audio information, and then the audio information is sent to the live broadcasting room. However, the mobile phone is limited by the microphone device and the sound-changing effect of the hardware device for processing the audio information, and cannot be compared with a terminal such as a computer and an external microphone device which are mature for decades. The effects of synthesizing and sound-changing processing on the audio information are much worse, so that the sound of the live broadcasting room is often quite noisy, the sound-changing effect is general and not lifelike, and the use experience of a user is affected.
Disclosure of Invention
The application mainly aims to provide an audio processing method, a terminal and a computer readable storage medium, which aim to solve the problem that microphone equipment and hardware equipment of a mobile phone per se are poor in audio information processing effect.
In order to achieve the above object, the present application provides an audio processing method applied to a first terminal, the first terminal including an audio processing module, the audio processing method including:
the first terminal acquires audio information, and performs preset processing on the audio information through the audio processing module to obtain target audio information;
The first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information;
and the second terminal and the first terminal establish communication connection through a preset communication protocol.
Optionally, the standard dual camera includes a first camera and a second camera, the first terminal is in communication connection with an audio device, the audio device includes a microphone, and the audio processing method further includes:
the first terminal judges whether to establish screen projection connection with the second terminal;
and if the second terminal has established screen projection connection with the first terminal, the first terminal controls the microphone to acquire the audio information.
Optionally, the step of acquiring the audio information by the first terminal includes:
the audio device sends the collected audio information to the first terminal, wherein the audio information and the target audio information are both audio data in a pulse code modulation format.
Optionally, the audio processing method further includes:
the second terminal acquires first identity identification information of the first terminal so as to send screen projection information to the first terminal based on the first identity identification information;
The first terminal obtains second identification information of the second terminal so as to send the target audio information to the second terminal based on the second identification information.
Optionally, the preset communication protocol is a Socket protocol, and the first terminal receives and synchronously plays the screen-throwing information.
Optionally, the step of performing preset processing on the audio information includes:
and sequentially performing sound changing processing and noise reduction processing on the audio information based on a preset algorithm.
Optionally, the step of sequentially performing the sound modification processing on the audio information includes:
changing sound attributes in the audio information; and/or
Changing the dialect type in the audio information; and/or
Changing languages in the audio information; and/or
Changing the current background sound of the audio information;
wherein the sound attribute includes at least one of a tone and a timbre.
Optionally, the step of performing a preset operation on the target audio information by the second terminal includes:
and the second terminal sends the target audio information to a preset application.
The application also provides a terminal, which comprises: a touch screen; a processor; and the memory is connected with the processor and contains control instructions, and when the processor reads the control instructions, the terminal is controlled to realize the audio processing method.
The present application also provides a computer-readable storage medium having one or more programs executed by one or more processors to implement the above-described audio processing method.
According to the audio processing method, the terminal and the computer readable storage medium, a first terminal acquires audio information, and preset processing is carried out on the audio information through an audio processing module of the first terminal to obtain target audio information; the first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information; the second terminal and the first terminal establish communication connection through a preset communication protocol, so that the audio information can be subjected to preset processing by using an audio processing module of the first terminal, the target audio information obtained through the preset processing has better effect, and the second terminal can acquire the target audio information and perform preset operation on the target audio information; further, the first terminal is in communication connection with an audio device, the audio device comprises a microphone, and the first terminal can judge whether to establish screen projection connection with the second terminal; if the second terminal and the first terminal are connected in a screen throwing way, the first terminal controls the microphone to collect the audio information, so that the microphone of the external audio equipment collects the audio information, the effect optimization of the collected audio information is realized, and the use experience of a user is improved.
The foregoing description is only an overview of the present application, and is intended to be implemented in accordance with the teachings of the present application in order that the same may be more clearly understood and to make the same and other objects, features and advantages of the present application more readily apparent.
Drawings
FIG. 1 is a schematic diagram of the hardware architecture of an alternative terminal for implementing various embodiments of the present application;
fig. 2 is a schematic diagram of a communication network system of the terminal shown in fig. 1;
FIG. 3 is a flowchart of an audio processing method according to an embodiment of the present application;
fig. 4 is a schematic application scenario of a first terminal according to an embodiment of the present application.
The achievement of the objects, functional features and advantages of the present application will be further described with reference to the accompanying drawings, in conjunction with the embodiments.
Detailed Description
It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the application.
In the following description, suffixes such as "module", "component", or "unit" for representing elements are used only for facilitating the description of the present application, and have no specific meaning per se. Thus, "module," "component," or "unit" may be used in combination.
The terminal may be implemented in various forms. For example, terminals described in the present invention may include terminals such as cell phones, tablet computers, notebook computers, palm computers, personal digital assistants (Personal Digital Assistant, PDA), portable media players (Portable Media Player, PMP), navigation devices, wearable devices, smart bracelets, pedometers, and the like, as well as stationary terminals such as digital TVs, desktop computers, and the like.
The following description will be given by taking a terminal as an example, and those skilled in the art will understand that the configuration according to the embodiment of the present invention can be applied to a fixed type terminal in addition to elements particularly used for a moving purpose.
Referring to fig. 1, which is a schematic hardware structure of a terminal implementing various embodiments of the present invention, the terminal 100 may include: an RF (Radio Frequency) unit 101, a WiFi module 102, an audio output unit 103, an a/V (audio/video) input unit 104, a sensor 105, a display unit 106, a user input unit 107, an interface unit 108, a memory 109, a processor 110, and a power supply 111. It will be appreciated by those skilled in the art that the terminal structure shown in fig. 1 is not limiting of the terminal and that the terminal may include more or fewer components than shown, or may combine certain components, or a different arrangement of components.
The various components of terminal 100 are described in detail below in conjunction with fig. 1:
the radio frequency unit 101 may be used for receiving and transmitting signals during the information receiving or communication process, specifically, after receiving downlink information of the base station, processing the downlink information by the processor 110; and, the uplink data is transmitted to the base station. Typically, the radio frequency unit 101 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, the radio frequency unit 101 may also communicate with networks and other devices via wireless communications. The wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System of Mobile communication, global System for Mobile communications), GPRS (General Packet Radio Service ), CDMA2000 (Code Division Multiple Access, CDMA 2000), WCDMA (Wideband Code Division Multiple Access ), TD-SCDMA (Time Division-Synchronous Code Division Multiple Access, time Division synchronous code Division multiple Access), FDD-LTE (Frequency Division Duplexing-Long Term Evolution, frequency Division Duplex Long term evolution), and TDD-LTE (Time Division Duplexing-Long Term Evolution, time Division Duplex Long term evolution), etc.
WiFi belongs to a short-distance wireless transmission technology, and a terminal can help a user to send and receive e-mails, browse web pages, access streaming media and the like through the WiFi module 102, so that wireless broadband Internet access is provided for the user. Although fig. 1 shows a WiFi module 102, it is understood that it does not belong to the essential constitution of the terminal, and can be omitted entirely as required within the scope of not changing the essence of the invention.
The audio output unit 103 may convert audio data received by the radio frequency unit 101 or the WiFi module 102 or stored in the memory 109 into an audio signal and output as sound when the terminal 100 is in a call signal reception mode, a talk mode, a recording mode, a voice recognition mode, a broadcast reception mode, or the like. Also, the audio output unit 103 may also provide audio output (e.g., a call signal reception sound, a message reception sound, etc.) related to a specific function performed by the terminal 100. The audio output unit 103 may include a speaker, a buzzer, and the like.
The a/V input unit 104 is used to receive an audio or video signal. The a/V input unit 104 may include a graphics processor (Graphics Processing Unit, GPU) 1041 and a microphone 1042, the graphics processor 1041 processing image data of still pictures or video obtained by an image capturing device (e.g., a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 106. The image frames processed by the graphics processor 1041 may be stored in the memory 109 (or other storage medium) or transmitted via the radio frequency unit 101 or the WiFi module 102. The microphone 1042 can receive sound (audio data) via the microphone 1042 in a phone call mode, a recording mode, a voice recognition mode, and the like, and can process such sound into audio data. The processed audio (voice) data may be converted into a format output that can be transmitted to the mobile communication base station via the radio frequency unit 101 in the case of a telephone call mode. The microphone 1042 may implement various types of noise cancellation (or suppression) algorithms to cancel (or suppress) noise or interference generated in the course of receiving and transmitting the audio signal.
The terminal 100 further comprises at least one sensor 105, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor and a proximity sensor, wherein the ambient light sensor can adjust the brightness of the display panel 1061 according to the brightness of ambient light, and the proximity sensor can turn off the display panel 1061 and/or the backlight when the terminal 100 moves to the ear. As one of the motion sensors, the accelerometer sensor can detect the acceleration in all directions (generally three axes), and can detect the gravity and direction when stationary, and can be used for applications of recognizing the gesture of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer gesture calibration), vibration recognition related functions (such as pedometer and knocking), and the like; as for other sensors such as fingerprint sensors, pressure sensors, iris sensors, molecular sensors, gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc. that may also be configured in the mobile phone, the detailed description thereof will be omitted.
The display unit 106 is used to display information input by a user or information provided to the user. The display unit 106 may include a display panel 1061, and the display panel 1061 may be configured in the form of a liquid crystal display (Liquid Crystal Display, LCD), an Organic Light-Emitting Diode (OLED), or the like.
The user input unit 107 may be used to receive input numeric or character information and to generate key signal inputs related to user settings and function control of the terminal. In particular, the user input unit 107 may include a touch panel 1071 and other input devices 1072. The touch panel 1071, also referred to as a touch screen, may collect touch operations thereon or thereabout by a user (e.g., operations of the user on the touch panel 1071 or thereabout by using any suitable object or accessory such as a finger, a stylus, etc.) and drive the corresponding connection device according to a predetermined program. The touch panel 1071 may include two parts of a touch detection device and a touch controller. The touch detection device detects the touch azimuth of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch detection device, converts it into touch point coordinates, and sends the touch point coordinates to the processor 110, and can receive and execute commands sent from the processor 110. Further, the touch panel 1071 may be implemented in various types such as resistive, capacitive, infrared, and surface acoustic wave. The user input unit 107 may include other input devices 1072 in addition to the touch panel 1071. In particular, other input devices 1072 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, mouse, joystick, etc., as specifically not limited herein.
Further, the touch panel 1071 may overlay the display panel 1061, and when the touch panel 1071 detects a touch operation thereon or thereabout, the touch panel 1071 is transferred to the processor 110 to determine the type of touch event, and then the processor 110 provides a corresponding visual output on the display panel 1061 according to the type of touch event. Although in fig. 1, the touch panel 1071 and the display panel 1061 are two independent components for implementing the input and output functions of the terminal, in some embodiments, the touch panel 1071 may be integrated with the display panel 1061 to implement the input and output functions of the terminal, which is not limited herein.
The interface unit 108 serves as an interface through which at least one external device is connected to the terminal 100. For example, the external devices may include a wired or wireless headset port, an external power (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 108 may be used to receive input (e.g., data information, power, etc.) from an external device and transmit the received input to one or more elements within the terminal 100 or may be used to transmit data between the terminal 100 and an external device.
Memory 109 may be used to store software programs as well as various data. The memory 109 may mainly include a storage program area that may store an operating system, application programs required for at least one function (such as a sound playing function, an image playing function, etc.), and a storage data area; the storage data area may store data (such as audio data, phonebook, etc.) created according to the use of the handset, etc. In addition, memory 109 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid-state storage device.
The processor 110 is a control center of the terminal, and connects various parts of the entire terminal using various interfaces and lines, and performs various functions of the terminal and processes data by running or executing software programs and/or modules stored in the memory 109 and calling data stored in the memory 109, thereby performing overall monitoring of the terminal. Processor 110 may include one or more processing units; preferably, the processor 110 may integrate an application processor that primarily handles operating systems, user interfaces, applications, etc., with a modem processor that primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 110.
The terminal 100 may further include a power source 111 (e.g., a battery) for supplying power to the respective components, and preferably, the power source 111 may be logically connected to the processor 110 through a power management system, so as to perform functions of managing charging, discharging, and power consumption management through the power management system.
Although not shown in fig. 1, the terminal 100 may further include a bluetooth module or the like, which is not described herein.
In order to facilitate understanding of the embodiments of the present invention, a communication network system on which the terminal of the present invention is based will be described below.
Referring to fig. 2, fig. 2 is a schematic diagram of a communication network system according to an embodiment of the present invention, where the communication network system is an LTE system of a general mobile communication technology, and the LTE system includes a UE (User Equipment) 201, an e-UTRAN (Evolved UMTS Terrestrial Radio Access Network ) 202, an epc (Evolved Packet Core, evolved packet core) 203, and an IP service 204 of an operator that are sequentially connected in communication.
Specifically, the UE201 may be the terminal 100 described above, and will not be described herein.
The E-UTRAN202 includes eNodeB2021 and other eNodeB2022, etc. The eNodeB2021 may be connected with other eNodeB2022 by a backhaul (e.g., an X2 interface), the eNodeB2021 is connected to the EPC203, and the eNodeB2021 may provide access from the UE201 to the EPC 203.
EPC203 may include MME (Mobility Management Entity ) 2031, hss (Home Subscriber Server, home subscriber server) 2032, other MMEs 2033, SGW (Serving Gate Way) 2034, pgw (PDN Gate Way) 2035 and PCRF (Policy and Charging Rules Function, policy and tariff function entity) 2036, and so on. The MME2031 is a control node that handles signaling between the UE201 and EPC203, providing bearer and connection management. HSS2032 is used to provide registers to manage functions such as home location registers (not shown) and to hold user specific information about service characteristics, data rates, etc. All user data may be sent through SGW2034 and PGW2035 may provide IP address allocation and other functions for UE201, PCRF2036 is a policy and charging control policy decision point for traffic data flows and IP bearer resources, which selects and provides available policy and charging control decisions for a policy and charging enforcement function (not shown).
IP services 204 may include the internet, intranets, IMS (IP Multimedia Subsystem ), or other IP services, etc.
Although the LTE system is described above as an example, it should be understood by those skilled in the art that the present application is not limited to LTE systems, but may be applied to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA, and future new network systems.
Based on the above terminal hardware structure and the communication network system, various embodiments of the method of the present application are provided.
Fig. 3 is a flowchart of an embodiment of an audio processing method provided by the present application. Once triggered by the user, the process in this embodiment automatically operates through the first terminal, where each step may be performed sequentially as shown in the flowchart, or may be performed simultaneously according to multiple steps in actual situations, which is not limited herein. The first terminal includes an audio processing module. The audio processing method provided by the application comprises the following steps:
step S310, the first terminal acquires audio information, and performs preset processing on the audio information through the audio processing module to obtain target audio information;
step S330, the first terminal sends the target audio information to a second terminal, so that the second terminal performs a preset operation on the target audio information; and the second terminal and the first terminal establish communication connection through a preset communication protocol.
According to the embodiment, the first terminal is used for acquiring the audio information, the audio processing module of the first terminal is used for carrying out preset processing on the audio information to obtain the target audio information, so that the target audio information is sent to the second terminal which is in communication connection with the first terminal through the preset communication protocol, the audio processing module of the first terminal can be used for carrying out preset processing on the audio information to enable the target audio information obtained through the preset processing to be better in effect, and the second terminal can be used for acquiring the target audio information and carrying out preset operation on the target audio information, so that the use experience of a user is improved.
The above steps will be specifically described with reference to specific embodiments.
In step S310, the first terminal acquires audio information, and performs preset processing on the audio information through the audio processing module to obtain target audio information.
In this embodiment, the first terminal may be a computer. The first terminal may include an audio processing module.
In this embodiment, the first terminal may be communicatively connected to an audio device, where the audio device may include a microphone, and the audio processing method further includes:
Step S3101, the first terminal judges whether to establish screen projection connection with a second terminal;
step S3102, if the second terminal has established a screen-projection connection with the first terminal, the first terminal controls the microphone to collect the audio information.
In this embodiment, the screen projection is to synchronously display data displayed on one terminal screen on another terminal or terminals so that the data displayed on the terminal screens are identical, as the name implies. Therefore, after the screen throwing connection is successful, the terminal can acquire local screen data so as to realize screen throwing after the screen data are transmitted.
In this embodiment, the second terminal may be a mobile phone. The second terminal may establish communication connection with the first terminal through a preset communication protocol, where the preset communication protocol may be a Socket protocol. Specifically, the second terminal can be connected with the first terminal through a code scanning or USB interface in a screen projection manner. For example, the second terminal may scan the two-dimensional code of the first terminal through the camera, obtain first identity identification information of the first terminal included in the two-dimensional code, so as to establish screen-casting connection with the first terminal based on the first identity identification information, and send screen-casting information to the first terminal, so that the first terminal receives and synchronously plays the screen-casting information. The screen shot information may include screen shot audio information and screen shot image information.
In this embodiment, after the second terminal initiates the request for establishing the screen-projection connection with the first terminal, if the screen-projection connection is successful, the second terminal and the first terminal generally receive a notification that the screen-projection connection is successful. And after the second terminal initiates a request for establishing screen connection with the first terminal, if a screen connection success message is not received in a preset time, judging that the screen connection is unsuccessful.
In this embodiment, after the second terminal and the first terminal establish communication connection through Socket protocol, the first terminal determines whether to establish screen-throwing connection with the second terminal, and if the second terminal has established screen-throwing connection with the first terminal, that is, the screen-throwing connection is successful, the first terminal controls an audio device in communication connection with the first terminal to start the microphone to collect the audio information. And if the second terminal does not establish screen projection connection with the first terminal, the first terminal does not control the microphone to collect the audio information. The audio information collected by the microphone may refer to sound information output by a user.
In this embodiment, the step of obtaining the audio information by the first terminal includes:
In step S3103, the audio device sends the collected audio information to the first terminal, where the audio information and the target audio information are both audio data in a pulse code modulation format.
Specifically, when the second terminal and the first terminal establish a screen-throwing connection, the first terminal controls the microphone to collect the audio information, and the audio equipment sends the audio information collected by the microphone to the first terminal, that is, the first terminal (computer) can obtain the audio information through the microphone of the external audio equipment, and the audio information collected by the microphone of the external audio equipment is more lifelike, so that the audio information obtained by the first terminal is more lifelike. The audio information is audio data in a pulse code modulation format, namely a PCM code stream.
In this embodiment, the step of performing the preset processing on the audio information includes:
step S3104, sequentially performing sound-modifying processing and noise-reducing processing on the audio information based on a preset algorithm.
In this embodiment, the audio processing module may sequentially perform the sound-changing process and the noise-reducing process on the audio information based on a preset algorithm. The may refer to a mature sound processing algorithm of the first terminal (computer). The audio processing module carries out sound changing processing on the audio information based on the mature sound processing algorithm of the computer side, so that more personalized voice can be obtained, and meanwhile, noise in the sound effect information can be removed through noise reduction processing.
In this embodiment, the step of sequentially performing the sound modification processing on the audio information includes:
changing sound attributes in the audio information; and/or
Changing the dialect type in the audio information; and/or
Changing languages in the audio information; and/or
Changing the current background sound of the audio information;
wherein the sound attribute includes at least one of a tone and a timbre.
In particular, the tone may refer to the height of the sound, which is determined by the frequency of the sound. Changing the pitch of the sound in the audio information may mean turning the pitch of the sound in the audio information high or turning the pitch of the sound in the audio information low, and the sound of the target audio information may be changed by changing the pitch of the sound in the audio information using the audio processing module. The tone color may refer to characteristics of different sound frequencies represented in terms of waveforms, which are determined by how much of their harmonics are and the relative intensities of the harmonics. The obtained target audio information is more interesting by changing the tone color of the sound in the audio information.
In this embodiment, a plurality of dialect types may be set in the first terminal in advance, and by changing the dialect type in the audio information, the sound of the target audio information may be changed. The audio processing module can adopt the existing conversion method to realize the change of the dialect type, the invention does not limit the conversion method, but in the process of changing the dialect type in the audio information, the basic meaning of the audio information needs to be controlled unchanged.
In this embodiment, a plurality of languages may be set in advance in the first terminal, and the sound of the target audio information may be changed by changing the languages in the audio information. The audio processing module can adopt the existing translation method to realize the change of languages, the invention does not limit the translation method, but in the process of changing the languages in the audio information, the basic meaning of the audio information needs to be controlled to be unchanged.
In this embodiment, the specific method for changing the current background sound of the audio information may include: adding new background sounds to the audio information and/or changing the current background sounds of the audio information. A new background sound may be set in advance on the first terminal side. The audio information may be enriched by changing the current background sound of the audio information.
In step S330, the first terminal sends the target audio information to a second terminal, so that the second terminal performs a preset operation on the target audio information.
In this embodiment, the first terminal may acquire the second identification information of the second terminal, so as to send the target audio information to the second terminal based on the second identification information, so that the second terminal performs a preset operation on the target audio information.
In this embodiment, the step of performing, by the second terminal, a preset operation on the target audio information includes:
in step S3301, the second terminal sends the target audio information to a preset application.
In this embodiment, the preset application may include a game application and a live room. When the second terminal and the first terminal are connected in a screen throwing manner, the first terminal can control the microphone to collect the audio information and acquire the audio information, the audio processing module sequentially carries out sound changing processing and noise reduction processing on the audio information to obtain target audio information, and then the first terminal sends the target audio information to the second terminal so that the second terminal can send the target audio information to a preset application, and the preset application can play the target audio information after receiving the target audio information. The second terminal may send the target audio information to the game application or the live broadcast room, or may send the target audio information to the game application before sending the target audio information to the live broadcast room.
For example, when the mobile phone and the computer are connected in a screen throwing manner, the computer can control the external audio equipment to start the microphone to acquire the audio information, the computer can acquire the audio information, and control the audio processing module to sequentially perform sound changing processing and noise reduction processing on the audio information to acquire target audio information, then the computer sends the target audio information to the mobile phone, and the mobile phone can send the target audio information to the living broadcasting room to play, so that the sound effect of the target audio information acquired through the microphone of the external audio equipment of the computer and the mature sound processing algorithm of the computer side is better than that of the mobile phone by utilizing the microphone and the sound processing algorithm of the mobile phone, the noise is less, and the sound experience of a user is improved. Meanwhile, the sound of the target audio information obtained by carrying out sound changing processing on the audio information is more interesting.
According to the embodiment, the first terminal acquires the audio information, and the audio processing module of the first terminal performs preset processing on the audio information to obtain target audio information; the first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information; the second terminal and the first terminal establish communication connection through a preset communication protocol, so that the audio information can be subjected to preset processing by using an audio processing module of the first terminal, the target audio information obtained through the preset processing has better effect, and the second terminal can acquire the target audio information and perform preset operation on the target audio information; further, the first terminal is in communication connection with an audio device, the audio device comprises a microphone, and the first terminal can judge whether to establish screen projection connection with the second terminal; if the second terminal and the first terminal are connected in a screen throwing way, the first terminal controls the microphone to collect the audio information, so that the microphone of the external audio equipment collects the audio information, the effect optimization of the collected audio information is realized, and the use experience of a user is improved.
Fig. 4 is a schematic application scenario of the first terminal 200 according to an embodiment of the present application. The first terminal 200 establishes a communication connection with the second terminal 210 through a preset communication protocol. The first terminal 200 includes an audio processing module, and the first terminal 200 may further include a memory 109 and a processor 110. The memory 109 is connected to the processor 110. The memory 109 contains control instructions which, when read by the processor 110, control the first terminal 200 to implement the steps of:
the first terminal 200 obtains audio information, and performs preset processing on the audio information through the audio processing module to obtain target audio information;
the first terminal 200 sends the target audio information to the second terminal 210, so that the second terminal 210 performs a preset operation on the target audio information;
wherein the second terminal establishes communication connection with the first terminal 200 through a preset communication protocol.
Optionally, the standard dual camera includes a first camera and a second camera, the first terminal 200 is communicatively connected to an audio device, the audio device includes a microphone, and the audio processing method further includes:
the first terminal 200 determines whether to establish a screen-throwing connection with the second terminal 210;
If the second terminal 210 has established a screen connection with the first terminal 200, the first terminal 200 controls the microphone to collect the audio information.
Optionally, the step of obtaining audio information by the first terminal 200 includes:
the audio device sends the collected audio information to the first terminal 200, where the audio information and the target audio information are both audio data in a pulse code modulation format.
Optionally, the method further comprises:
the second terminal 210 acquires first identity information of the first terminal 200, so as to send screen projection information to the first terminal 200 based on the first identity information;
the first terminal 200 obtains second identification information of the second terminal 210 to send the target audio information to the second terminal 210 based on the second identification information.
Optionally, the preset communication protocol is a Socket protocol, and the first terminal 200 receives and synchronously plays the screen-throwing information.
Optionally, the step of performing preset processing on the audio information includes:
and sequentially performing sound changing processing and noise reduction processing on the audio information based on a preset algorithm.
Optionally, the step of sequentially performing the sound modification processing on the audio information includes:
changing sound attributes in the audio information; and/or
Changing the dialect type in the audio information; and/or
Changing languages in the audio information; and/or
Changing the current background sound of the audio information;
wherein the sound attribute includes at least one of a tone and a timbre.
Optionally, the step of performing a preset operation on the target audio information by the second terminal 210 includes:
the second terminal 210 transmits the target audio information to a preset application.
Acquiring audio information through the first terminal 200, and performing preset processing on the audio information through an audio processing module of the first terminal 200 to obtain target audio information; the first terminal 200 sends the target audio information to the second terminal 210, so that the second terminal 210 performs a preset operation on the target audio information; the second terminal 210 establishes communication connection with the first terminal 200 through a preset communication protocol, so that the audio processing module of the first terminal 200 can be utilized to perform preset processing on the audio information, so that the target audio information obtained through the preset processing has a better effect, and the second terminal 210 can obtain the target audio information and perform preset operation on the target audio information; further, the first terminal 200 is communicatively connected to an audio device, where the audio device includes a microphone, and the first terminal 200 may determine whether to establish a screen-projection connection with the second terminal 210; if the second terminal 210 has established a screen-throwing connection with the first terminal 200, the first terminal 200 controls the microphone to collect the audio information, so that the microphone of the external audio device collects the audio information, thereby realizing the effect optimization of the collected audio information and improving the use experience of the user.
The embodiment of the present application also provides a computer-readable storage medium having one or more programs executed by one or more processors to implement the steps of:
the method comprises the steps that a first terminal obtains audio information, and preset processing is conducted on the audio information through an audio processing module of the first terminal to obtain target audio information;
the first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information;
and the second terminal and the first terminal establish communication connection through a preset communication protocol.
Optionally, the standard dual camera includes a first camera and a second camera, the first terminal is in communication connection with an audio device, the audio device includes a microphone, and the audio processing method further includes:
the first terminal judges whether to establish screen projection connection with the second terminal;
and if the second terminal has established screen projection connection with the first terminal, the first terminal controls the microphone to acquire the audio information.
Optionally, the step of acquiring the audio information by the first terminal includes:
The audio device sends the collected audio information to the first terminal, wherein the audio information and the target audio information are both audio data in a pulse code modulation format.
Optionally, the method further comprises:
the second terminal acquires first identity identification information of the first terminal so as to send screen projection information to the first terminal based on the first identity identification information;
the first terminal obtains second identification information of the second terminal so as to send the target audio information to the second terminal based on the second identification information.
Optionally, the preset communication protocol is a Socket protocol, and the first terminal receives and synchronously plays the screen-throwing information.
Optionally, the step of performing preset processing on the audio information includes:
and sequentially performing sound changing processing and noise reduction processing on the audio information based on a preset algorithm.
Optionally, the step of sequentially performing the sound modification processing on the audio information includes:
changing sound attributes in the audio information; and/or
Changing the dialect type in the audio information; and/or
Changing languages in the audio information; and/or
Changing the current background sound of the audio information;
wherein the sound attribute includes at least one of a tone and a timbre.
Optionally, the step of performing a preset operation on the target audio information by the second terminal includes:
and the second terminal sends the target audio information to a preset application.
Acquiring audio information through the first terminal through the computer readable storage medium, and performing preset processing on the audio information through an audio processing module of the first terminal to obtain target audio information; the first terminal sends the target audio information to a second terminal so that the second terminal can perform preset operation on the target audio information; the second terminal and the first terminal establish communication connection through a preset communication protocol, so that the audio information can be subjected to preset processing by using an audio processing module of the first terminal, the target audio information obtained through the preset processing has better effect, and the second terminal can acquire the target audio information and perform preset operation on the target audio information; further, the first terminal is in communication connection with an audio device, the audio device comprises a microphone, and the first terminal can judge whether to establish screen projection connection with the second terminal; if the second terminal and the first terminal are connected in a screen throwing way, the first terminal controls the microphone to collect the audio information, so that the microphone of the external audio equipment collects the audio information, the effect optimization of the collected audio information is realized, and the use experience of a user is improved.
The embodiment of the application also provides a computer readable storage medium. The computer-readable storage medium here stores one or more programs. Wherein the computer readable storage medium may include volatile memory, such as random access memory; the memory may also include non-volatile memory, such as read-only memory, flash memory, hard disk, or solid state disk; the memory may also comprise a combination of the above types of memories.
The corresponding technical features in the above embodiments can be used mutually without causing contradiction between schemes or incapacitation.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The foregoing embodiment numbers of the present application are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.
From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solution of the present application may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium (e.g. ROM/RAM, magnetic disk, optical disk) comprising instructions for causing a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the method according to the embodiments of the present application.
The embodiments of the present application have been described above with reference to the accompanying drawings, but the present application is not limited to the above-described embodiments, which are merely illustrative and not restrictive, and many forms may be made by those having ordinary skill in the art without departing from the spirit of the present application and the scope of the claims, which are to be protected by the present application.

Claims (10)

1. An audio processing method applied to a first terminal, wherein the first terminal comprises an audio processing module, and the audio processing method is characterized in that the first terminal is in communication connection with an audio device, and comprises the following steps:
the first terminal judges whether screen throwing connection is established with a second terminal or not;
if the second terminal has established screen-throwing connection with the first terminal, the first terminal controls the audio equipment to acquire audio information, and preset processing is carried out on the audio information through the audio processing module to obtain target audio information;
the first terminal sends the target audio information to a second terminal so that the second terminal performs preset operation on the target audio information, and the second terminal and the first terminal establish communication connection through a preset communication protocol;
and if the second terminal does not establish screen projection connection with the first terminal, the first terminal does not control the audio equipment to acquire the audio information.
2. The audio processing method of claim 1, wherein the audio device comprises a microphone.
3. The audio processing method of claim 2, wherein the step of the first terminal acquiring audio information comprises:
The audio device sends the collected audio information to the first terminal, wherein the audio information and the target audio information are both audio data in a pulse code modulation format.
4. The audio processing method according to claim 2, wherein the audio processing method further comprises:
the second terminal acquires first identity identification information of the first terminal so as to send screen projection information to the first terminal based on the first identity identification information;
the first terminal obtains second identification information of the second terminal so as to send the target audio information to the second terminal based on the second identification information.
5. The audio processing method according to claim 4, wherein the preset communication protocol is a Socket protocol, and the first terminal receives and synchronously plays the screen-throwing information.
6. The audio processing method according to claim 1, wherein the step of performing a preset process on the audio information includes:
and sequentially performing sound changing processing and noise reduction processing on the audio information based on a preset algorithm.
7. The audio processing method of claim 6, wherein the step of sequentially performing the sound varying process on the audio information comprises:
Changing sound attributes in the audio information; and/or
Changing the dialect type in the audio information; and/or
Changing languages in the audio information; and/or
Changing the current background sound of the audio information;
wherein the sound attribute includes at least one of a tone and a timbre.
8. The audio processing method as claimed in claim 1, wherein the step of performing a preset operation on the target audio information by the second terminal includes:
and the second terminal sends the target audio information to a preset application.
9. A terminal, the terminal comprising:
a processor; and
A memory connected to the processor, the memory containing control instructions which, when read by the processor, control the terminal to implement the audio processing method of any one of claims 1-8.
10. A computer readable storage medium having one or more programs for execution by one or more processors to implement the audio processing method of any of claims 1-8.
CN202011373395.1A 2020-11-30 2020-11-30 Audio processing method, terminal and computer readable storage medium Active CN112562639B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011373395.1A CN112562639B (en) 2020-11-30 2020-11-30 Audio processing method, terminal and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011373395.1A CN112562639B (en) 2020-11-30 2020-11-30 Audio processing method, terminal and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN112562639A CN112562639A (en) 2021-03-26
CN112562639B true CN112562639B (en) 2023-09-19

Family

ID=75045387

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011373395.1A Active CN112562639B (en) 2020-11-30 2020-11-30 Audio processing method, terminal and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN112562639B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113223542B (en) * 2021-04-26 2024-04-12 北京搜狗科技发展有限公司 Audio conversion method and device, storage medium and electronic equipment

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103702238A (en) * 2013-12-23 2014-04-02 华为终端有限公司 Multi-screen video sharing method, terminals and server
CN103856390A (en) * 2012-12-04 2014-06-11 腾讯科技(深圳)有限公司 Instant messaging method and system, messaging information processing method and terminals
CN104618793A (en) * 2015-02-28 2015-05-13 联想(北京)有限公司 Information processing method and electronic equipment
CN107517399A (en) * 2016-06-16 2017-12-26 腾讯科技(深圳)有限公司 A kind of synchronous method of media information and server
WO2018040813A1 (en) * 2016-08-31 2018-03-08 腾讯科技(深圳)有限公司 Audio/video communication method, terminal, server, and storage medium
CN109065065A (en) * 2018-09-27 2018-12-21 南昌努比亚技术有限公司 Call method, mobile terminal and computer readable storage medium
CN110225195A (en) * 2019-05-30 2019-09-10 维沃移动通信有限公司 A kind of audio communication method and terminal
CN110418189A (en) * 2019-08-02 2019-11-05 钟国波 A kind of low latency can be used for transmitting game, high frame per second audio/video transmission method
CN110505496A (en) * 2018-05-16 2019-11-26 腾讯科技(深圳)有限公司 Live-broadcast control method and device, storage medium and electronic device
CN209859129U (en) * 2019-06-12 2019-12-27 深圳市美嘉光电科技有限公司 Live broadcast sound card with time management function
CN110956951A (en) * 2019-12-23 2020-04-03 苏州思必驰信息科技有限公司 Voice enhancement acquisition accessory, method, system, equipment and storage medium
CN111212032A (en) * 2019-12-13 2020-05-29 视联动力信息技术股份有限公司 Audio processing method and device based on video network, electronic equipment and storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10757755B2 (en) * 2016-06-16 2020-08-25 Huawei Technologies Co., Ltd. Method and apparatus for communication between wearable device and mobile terminal
KR102471499B1 (en) * 2016-07-05 2022-11-28 삼성전자주식회사 Image Processing Apparatus and Driving Method Thereof, and Computer Readable Recording Medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103856390A (en) * 2012-12-04 2014-06-11 腾讯科技(深圳)有限公司 Instant messaging method and system, messaging information processing method and terminals
CN103702238A (en) * 2013-12-23 2014-04-02 华为终端有限公司 Multi-screen video sharing method, terminals and server
CN104618793A (en) * 2015-02-28 2015-05-13 联想(北京)有限公司 Information processing method and electronic equipment
CN107517399A (en) * 2016-06-16 2017-12-26 腾讯科技(深圳)有限公司 A kind of synchronous method of media information and server
WO2018040813A1 (en) * 2016-08-31 2018-03-08 腾讯科技(深圳)有限公司 Audio/video communication method, terminal, server, and storage medium
CN110505496A (en) * 2018-05-16 2019-11-26 腾讯科技(深圳)有限公司 Live-broadcast control method and device, storage medium and electronic device
CN109065065A (en) * 2018-09-27 2018-12-21 南昌努比亚技术有限公司 Call method, mobile terminal and computer readable storage medium
CN110225195A (en) * 2019-05-30 2019-09-10 维沃移动通信有限公司 A kind of audio communication method and terminal
CN209859129U (en) * 2019-06-12 2019-12-27 深圳市美嘉光电科技有限公司 Live broadcast sound card with time management function
CN110418189A (en) * 2019-08-02 2019-11-05 钟国波 A kind of low latency can be used for transmitting game, high frame per second audio/video transmission method
CN111212032A (en) * 2019-12-13 2020-05-29 视联动力信息技术股份有限公司 Audio processing method and device based on video network, electronic equipment and storage medium
CN110956951A (en) * 2019-12-23 2020-04-03 苏州思必驰信息科技有限公司 Voice enhancement acquisition accessory, method, system, equipment and storage medium

Also Published As

Publication number Publication date
CN112562639A (en) 2021-03-26

Similar Documents

Publication Publication Date Title
CN112004173B (en) Bluetooth headset switching method, mobile terminal and computer readable storage medium
CN111708503B (en) Screen projection control method, device and computer readable storage medium
CN109195143B (en) Network access method, mobile terminal and readable storage medium
CN112394896B (en) Reverse screen-throwing frame rate control method, device and computer readable storage medium
CN112689191B (en) Screen projection control method, terminal and computer readable storage medium
CN111479309B (en) Service processing method, user equipment and computer readable storage medium
CN110187808B (en) Dynamic wallpaper setting method and device and computer-readable storage medium
CN109088997B (en) Game audio control method, terminal and computer readable storage medium
CN112433690B (en) Data processing method, terminal and computer readable storage medium
CN108834157A (en) Internet wide band distribution, mobile terminal and computer readable storage medium
CN112600815B (en) Video display method, terminal and computer readable storage medium
CN110052024B (en) Game vibration adjusting method, adjusting device, mobile terminal and storage medium
CN112492340B (en) Live broadcast audio acquisition method, mobile terminal and computer readable storage medium
CN112562639B (en) Audio processing method, terminal and computer readable storage medium
CN109889665B (en) Volume adjusting method, mobile terminal and storage medium
CN110392163B (en) Screen-off control method, mobile terminal and computer-readable storage medium
CN109462829B (en) Call transfer method, device and computer readable storage medium
CN112437472A (en) Network switching method, equipment and computer readable storage medium
CN108805559B (en) Two-dimensional code pushing method, mobile terminal and computer readable storage medium
CN112887776B (en) Method, equipment and computer readable storage medium for reducing audio delay
CN111970668B (en) Bluetooth audio control method, device and computer readable storage medium
CN110262707B (en) Application program operation recording method and device and computer readable storage medium
CN110286959B (en) Game list management method, mobile terminal and computer readable storage medium
CN110266991B (en) Video call method, mobile terminal and computer readable storage medium
CN114756187A (en) Screen-casting video image processing method and equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant