CN113225615A - Television program playing method, terminal equipment, server and storage medium - Google Patents
Television program playing method, terminal equipment, server and storage medium Download PDFInfo
- Publication number
- CN113225615A CN113225615A CN202110427756.4A CN202110427756A CN113225615A CN 113225615 A CN113225615 A CN 113225615A CN 202110427756 A CN202110427756 A CN 202110427756A CN 113225615 A CN113225615 A CN 113225615A
- Authority
- CN
- China
- Prior art keywords
- television program
- description information
- key frame
- server
- playing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 64
- 102100037812 Medium-wave-sensitive opsin 1 Human genes 0.000 claims description 25
- 238000004891 communication Methods 0.000 description 12
- 238000012545 processing Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 7
- 230000002093 peripheral effect Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 206010047571 Visual impairment Diseases 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000001771 impaired effect Effects 0.000 description 2
- 208000029257 vision disease Diseases 0.000 description 2
- 230000004393 visual impairment Effects 0.000 description 2
- 241000251730 Chondrichthyes Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440236—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47202—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/482—End-user interface for program selection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4888—Data services, e.g. news ticker for displaying teletext characters
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Databases & Information Systems (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Abstract
The invention discloses a television program playing method used for terminal equipment, which comprises the following steps: when a target television program is received, acquiring a key frame of the television program; sending the key frame to a server so that the server can acquire data of the key frame to obtain text description information; when receiving the character description information sent by the server, obtaining reading audio based on the character description information; and playing the reading audio. The invention also discloses a terminal device, a server and a computer readable storage medium. By using the television program playing method, technical personnel do not need to record description audio manually, so that the labor cost is saved, and the user experience is better.
Description
Technical Field
The present invention relates to the field of television program processing, and in particular, to a television program playing method, a terminal device, a server, and a computer-readable storage medium.
Background
At present, two television program playing methods are disclosed in order to enable people with visual impairment to watch television programs; firstly, adding some voice-over in the original audio in the television program to describe the current scene; second, the normal audio and the voice of the voice-over are divided, and the user can select whether the voice-over is required.
In the two television program playing methods, technicians perform voice description on pictures in the television programs, and store the voice description in the television programs in an external voice mode so as to enable people with visual impairment to watch the television programs.
However, the existing television program playing method consumes a large amount of labor cost.
Disclosure of Invention
The invention mainly aims to provide a television program playing method, terminal equipment, a server and a computer readable storage medium, and aims to solve the technical problem that the labor cost is high in the prior art by using the existing television program playing method.
In order to achieve the above object, the present invention provides a method for playing a television program, which is used for a terminal device, and the method comprises the following steps:
when a target television program is received, acquiring a key frame of the television program;
sending the key frame to a server so that the server can acquire data of the key frame to obtain text description information;
when receiving the character description information sent by the server, obtaining reading audio based on the character description information;
and playing the reading audio.
Optionally, before the step of sending the key frame to the server, the method further includes:
acquiring current time corresponding to the key frame;
the step of sending the key frame to a server includes:
and sending the key frame and the current time to a server so that the server acquires data of the key frame to obtain picture description information, and marking the picture description information by using the current time to obtain text description information.
Optionally, the target television program is a television program composed of GOP sequences; the step of acquiring the key frame of the television program when receiving the target television program comprises the following steps:
when a target television program is received, acquiring the GOP sequence in the target television program;
determining an I-frame in the GOP sequence as the key frame.
Optionally, when receiving the text description information sent by the server, the step of obtaining a reading audio based on the text description information includes:
when the character description information is received, decoding the character description information to obtain natural language information;
and when the current time in the text description information is reached, reading the natural language information to obtain the reading audio.
In addition, in order to achieve the above object, the present invention further provides a television program playing method, which is used for a server, and the method includes the following steps:
receiving a key frame sent by terminal equipment, wherein the key frame is acquired in a television program when a target program is received;
acquiring data of the key frame to obtain text description information;
and sending the text description information to the terminal equipment so that the terminal equipment obtains a reading audio based on the text description information and plays the reading audio.
Optionally, before the step of acquiring data of the key frame and obtaining the text description information, the method further includes:
receiving the current time sent by the terminal equipment, wherein the current time corresponds to the key frame;
the step of acquiring data of the key frame to obtain the text description information comprises the following steps:
acquiring data of the key frame to obtain picture description information;
and marking the picture description information by using the current time to obtain the text description information.
Optionally, the textual description information includes the current time, a name of a thing in the key frame, location information of the thing, and a feature of the thing.
In addition, to achieve the above object, the present invention further provides a terminal device, including: the system comprises a memory, a processor and a television program playing program stored on the memory and running on the processor, wherein the television program playing program realizes the steps of the television program playing method according to any one of the above items when being executed by the processor.
In addition, to achieve the above object, the present invention further provides a server, including: the system comprises a memory, a processor and a television program playing program stored on the memory and running on the processor, wherein the television program playing program realizes the steps of the television program playing method according to any one of the above items when being executed by the processor.
In addition, to achieve the above object, the present invention further provides a computer readable storage medium, having a television program playing program stored thereon, where the television program playing program, when executed by a processor, implements the steps of the television program playing method according to any one of the above items.
The technical scheme of the invention provides a television program playing method which is used for terminal equipment and comprises the following steps: when a target television program is received, acquiring a key frame of the television program; sending the key frame to a server so that the server can acquire data of the key frame to obtain text description information; when receiving the character description information sent by the server, obtaining reading audio based on the character description information; and playing the reading audio.
In the existing television program playing method, technicians are required to manually record description audio of pictures to obtain voice-over, and the voice-over is added into the television program, so that more labor cost is consumed. In the invention, the server acquires the data of the key frame to obtain the text description information, and meanwhile, the terminal equipment obtains the reading audio based on the text description information without manually recording the description audio by technicians, so that the labor cost is saved, and the user experience is better.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the structures shown in the drawings without creative efforts.
Fig. 1 is a schematic structural diagram of a terminal device in a hardware operating environment according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of a television program playing method according to a first embodiment of the present invention;
FIG. 3 is a diagram of an exemplary key frame of the present invention;
fig. 4 is a flowchart illustrating a television program playing method according to a second embodiment of the present invention;
fig. 5 is a block diagram of a first embodiment of a tv program playing apparatus according to the present invention;
fig. 6 is a block diagram of a television program playback apparatus according to a second embodiment of the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a terminal device in a hardware operating environment according to an embodiment of the present invention.
The terminal device may be a User Equipment (UE) such as a Mobile phone, a smart phone, a laptop, a digital broadcast receiver, a Personal Digital Assistant (PDA), a tablet computer (PAD), a handheld device, a vehicle mounted device, a wearable device, a computing device or other processing device connected to a wireless modem, a Mobile Station (MS), etc. The terminal device may be referred to as a user terminal, a portable terminal, a desktop terminal, etc.
In general, a terminal device includes: at least one processor 301, a memory 302, and a television program playback program stored on said memory and executable on said processor, said television program playback program being configured to implement the steps of the television program playback method as described above.
The processor 301 may include one or more processing cores, such as a 4-core processor, an 8-core processor, and so on. The processor 301 may be implemented in at least one hardware form of a DSP (Digital Signal Processing), an FPGA (Field-Programmable Gate Array), and a PLA (Programmable Logic Array). The processor 301 may also include a main processor and a coprocessor, where the main processor is a processor for processing data in an awake state, and is also called a Central Processing Unit (CPU); a coprocessor is a low power processor for processing data in a standby state. In some embodiments, the processor 301 may be integrated with a GPU (Graphics Processing Unit), which is responsible for rendering and drawing the content required to be displayed on the display screen. The processor 301 may further include an AI (Artificial Intelligence) processor for processing relevant tv program broadcasting method operations, so that the tv program broadcasting method model can be trained and learned autonomously, thereby improving efficiency and accuracy.
In some embodiments, the terminal may further include: a communication interface 303 and at least one peripheral device. The processor 301, the memory 302 and the communication interface 303 may be connected by a bus or signal lines. Various peripheral devices may be connected to communication interface 303 via a bus, signal line, or circuit board. Specifically, the peripheral device includes: at least one of radio frequency circuitry 304, a display screen 305, and a power source 306.
The communication interface 303 may be used to connect at least one peripheral device related to I/O (Input/Output) to the processor 301 and the memory 302. In some embodiments, processor 301, memory 302, and communication interface 303 are integrated on the same chip or circuit board; in some other embodiments, any one or two of the processor 301, the memory 302 and the communication interface 303 may be implemented on a single chip or circuit board, which is not limited in this embodiment.
The Radio Frequency circuit 304 is used for receiving and transmitting RF (Radio Frequency) signals, also called electromagnetic signals. The radio frequency circuitry 304 communicates with communication networks and other communication devices via electromagnetic signals. The rf circuit 304 converts an electrical signal into an electromagnetic signal to transmit, or converts a received electromagnetic signal into an electrical signal. Optionally, the radio frequency circuit 304 comprises: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and so forth. The radio frequency circuitry 304 may communicate with other terminals via at least one wireless communication protocol. The wireless communication protocols include, but are not limited to: metropolitan area networks, various generation mobile communication networks (2G, 3G, 4G, and 5G), Wireless local area networks, and/or WiFi (Wireless Fidelity) networks. In some embodiments, the rf circuit 304 may further include NFC (Near Field Communication) related circuits, which are not limited in this application.
The display screen 305 is used to display a UI (User Interface). The UI may include graphics, text, icons, video, and any combination thereof. When the display screen 305 is a touch display screen, the display screen 305 also has the ability to capture touch signals on or over the surface of the display screen 305. The touch signal may be input to the processor 301 as a control signal for processing. At this point, the display screen 305 may also be used to provide virtual buttons and/or a virtual keyboard, also referred to as soft buttons and/or a soft keyboard. In some embodiments, the display screen 305 may be one, the front panel of the electronic device; in other embodiments, the display screens 305 may be at least two, respectively disposed on different surfaces of the electronic device or in a folded design; in still other embodiments, the display screen 305 may be a flexible display screen disposed on a curved surface or a folded surface of the electronic device. Even further, the display screen 305 may be arranged in a non-rectangular irregular figure, i.e. a shaped screen. The Display screen 305 may be made of LCD (liquid crystal Display), OLED (Organic Light-Emitting Diode), and the like.
The power supply 306 is used to power various components in the electronic device. The power source 306 may be alternating current, direct current, disposable or rechargeable. When the power source 306 includes a rechargeable battery, the rechargeable battery may support wired or wireless charging. The rechargeable battery may also be used to support fast charge technology.
Those skilled in the art will appreciate that the configuration shown in fig. 1 does not constitute a limitation of the terminal device and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
In addition, the embodiment of the present invention further provides a server, where the structural description of the server refers to the description of the terminal device above, and the structure is similar, and is not described herein again.
In addition, an embodiment of the present invention further provides a computer-readable storage medium, where a television program playing program is stored on the computer-readable storage medium, and when the television program playing program is executed by a processor, the steps of the television program playing method described above are implemented. Therefore, a detailed description thereof will be omitted. In addition, the beneficial effects of the same method are not described in detail. For technical details not disclosed in embodiments of the computer-readable storage medium referred to in the present application, reference is made to the description of embodiments of the method of the present application. It is determined that the program instructions may be deployed to be executed on one terminal device, or on multiple terminal devices located at one site, or distributed across multiple sites and interconnected by a communication network, as examples.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The computer-readable storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.
Based on the hardware structure, the embodiment of the television program playing method is provided.
Referring to fig. 2, fig. 2 is a schematic flowchart of a television program playing method according to a first embodiment of the present invention, where the method is used for a terminal device, and the method includes the following steps:
step S11: and when receiving the target television program, acquiring a key frame of the television program.
It should be noted that, the main execution bodies of the method are the terminal device and the server, the terminal device and the server respectively install the corresponding terminal television program playing program and the server television program playing program, and the terminal device and the server execute the corresponding terminal television program playing program and the server television program playing program, so as to implement the television program playing method of the present invention.
Further, the target television program is a television program composed of GOP sequences; the step of acquiring the key frame of the television program when receiving the target television program comprises the following steps: when a target television program is received, acquiring the GOP sequence in the target television program; determining an I-frame in the GOP sequence as the key frame.
It will be appreciated that the target television program may be a television program, typically video data, viewed by a user (visually impaired), the video data being a group of pictures consisting of a fixed series of encoded picture frames, i.e. the GOP sequence, and the representation of a GOP sequence may be IBBPBBPBBPBBPBB (a GOP sequence comprising a number of frames, the number of which is not a fixed value, related to the specific data of the television program, wherein the number of frames in a GOP sequence is an I-frame and a plurality of alternating B-frames and P-frames). In the present application, a plurality of GOP sequences related to a target television program have been decoded in a memory of a terminal device and are waiting for playing, and the terminal device plays the GOP sequences into continuous pictures according to a time stamp (playing time in a video frame) or a synchronization mechanism.
For example, there are currently two GOP sequences as follows: IBBPBBPBBPBBPBB and IBBPBBPBBPBBPBB, is broadcasting the first GOP sequence IBBPBBPBPBB at present, at this moment, obtain the key frame of the next GOP sequence-frame I, and send this frame I to the server.
In the application, the terminal device plays a group of pictures corresponding to a GOP sequence, and the I frame is usually the first frame in the GOP sequence; the data transmission of the terminal device and the server can be rapid transmission through a millimeter wave (5G) protocol; meanwhile, when a picture group corresponding to one GOP sequence is played, acquiring a key frame of the next GOP sequence, and sending the key frame (and the current time which is the playing time of the key frame) to a server; based on the three points, when the terminal device plays the group of pictures corresponding to one GOP sequence, the server and the terminal device have enough time to perform data transmission and data processing, so that the character description information is obtained before the group of pictures corresponding to the GOP sequence is played, and the reading audio corresponding to the character description information is played.
Step S12: and sending the key frame to a server so that the server can acquire data of the key frame to obtain the text description information.
Further, before step S12, the method further includes: acquiring current time corresponding to the key frame; accordingly, step S12 includes: and sending the key frame and the current time to a server so that the server acquires data of the key frame to obtain picture description information, and marking the picture description information by using the current time to obtain text description information.
The server collects picture description information in the key frame based on a preset algorithm (the preset algorithm can be set by a user based on requirements, and the invention is not limited), wherein the picture description information comprises names of objects in the key frame, position information of the objects and characteristics of the objects (namely, the text description information comprises the current time, the names of the objects in the key frame, the position information of the objects and the characteristics of the objects). The characteristics of the real object may include the size, color, shape, and the like of the object.
The server generates a transmission file according to the acquired text description information according to a preset script, wherein the transmission file can be in a txt format or a json format, and sends the text description information to the terminal equipment in a file transmission mode. The preset script may be in the following format:
step S13: and when receiving the text description information sent by the server, obtaining the reading audio based on the text description information.
Step S14: and playing the reading audio.
It is understood that the textual description information is sent in a txt format or json format file, from which the terminal device extracts the textual description information.
Further, the step of obtaining the reading audio based on the text description information when receiving the text description information sent by the server includes: when the character description information is received, decoding the character description information to obtain natural language information; and when the current time in the text description information is reached, reading the natural language information to obtain the reading audio.
When the text description information is read aloud, the obtained speech corresponding to the encoded data, which is not the natural language audio, needs to decode and translate the text description information to obtain natural language information understandable by a user (visually impaired person) so as to read aloud the natural language information to obtain aloud audio, and at this time, the aloud audio can be understood by the user.
Referring to FIG. 3, FIG. 3 is a diagram illustrating exemplary key frames according to the present invention. The key frame is an I frame in a GOP sequence, and the current time corresponding to the key frame is 08:00:00(24 hours, i.e. 8 am), at this time, the content of picture description information obtained by the server is as follows:
then, the server sends the text description information (including the above picture description information and the target event) to the terminal device in a text (txt format or json format file) transmission mode, and the terminal device decodes and translates the text description information to obtain natural language information:
the character in the current picture is playing golf in the golf course; one adult man who wears blue short sleeves on the upper body and white five-part pants on the lower body at the left side of the picture and has a white peaked cap with a height of about one meter and seven swings a golf club; the bluish sky in the upper third of the picture has many white clouds; the middle third of the picture in the middle of the picture is provided with a plurality of green or brown trees with branches connected with one tree; a small lake is located about 100 meters from an adult male. There are many trees beside the oval lake surface, and the lake water is clear and glistening in the shark; the grassland is in the middle of the lower part of the picture, and the sunlight is scattered on the grassland and has the shadow of a lot of trees.
When the text description information is obtained, when the current time in the text description information is reached, the natural language information is read aloud, and the aloud audio is obtained and comprises the audio corresponding to the text.
The technical scheme of the invention provides a television program playing method which is used for terminal equipment and comprises the following steps: when a target television program is received, acquiring a key frame of the television program; sending the key frame to a server so that the server can acquire data of the key frame to obtain text description information; when receiving the character description information sent by the server, obtaining reading audio based on the character description information; and playing the reading audio.
In the existing television program playing method, technicians are required to manually record description audio of pictures to obtain voice-over, and the voice-over is added into the television program, so that more labor cost is consumed. In the invention, the server acquires the data of the key frame to obtain the text description information, and meanwhile, the terminal equipment obtains the reading audio based on the text description information without manually recording the description audio by technicians, so that the labor cost is saved, and the user experience is better.
In addition, in the existing method, a technician manually inputs voice description (voice over. In the invention, the server can update the algorithm at any time, so that the obtained text description information is not lagged, the comprehensiveness is better, and the user experience is better.
Referring to fig. 4, fig. 4 is a schematic flowchart of a television program playing method according to a second embodiment of the present invention, where the method is used in a server, and the method includes the following steps:
step S21: receiving a key frame sent by terminal equipment, wherein the key frame is acquired in a television program when a target program is received;
step S22: acquiring data of the key frame to obtain text description information;
step S23: and sending the text description information to the terminal equipment so that the terminal equipment obtains a reading audio based on the text description information and plays the reading audio.
Further, before the step of acquiring data of the key frame and obtaining the text description information, the method further includes: receiving the current time sent by the terminal equipment, wherein the current time corresponds to the key frame; correspondingly, the step of acquiring data of the key frame to obtain the text description information includes: acquiring data of the key frame to obtain picture description information; and marking the picture description information by using the current time to obtain the text description information.
The content description of the server side refers to the above description, and is not repeated here.
Referring to fig. 5, fig. 5 is a block diagram of a first embodiment of a television program playing apparatus according to the present invention, where the apparatus is used for a terminal device, and the apparatus includes:
a first receiving module 10, configured to obtain a key frame of a television program when receiving a target television program;
the first sending module 20 is configured to send the key frame to a server, so that the server performs data acquisition on the key frame to obtain text description information;
an obtaining module 30, configured to obtain a reading audio based on the text description information when receiving the text description information sent by the server;
and the playing module 40 is used for playing the reading audio.
Referring to fig. 6, fig. 6 is a block diagram of a first embodiment of a tv program playing apparatus according to the present invention, the apparatus is used for a server, and the apparatus includes:
a second receiving module 50, configured to receive a key frame sent by a terminal device, where the key frame is obtained in a television program when a target program is received;
an acquisition module 60, configured to perform data acquisition on the key frame to obtain text description information;
and a second sending module 70, configured to send the text description information to the terminal device, so that the terminal device obtains a reading audio based on the text description information, and plays the reading audio.
The above description is only an alternative embodiment of the present invention, and not intended to limit the scope of the present invention, and all modifications and equivalents of the present invention, which are made by the contents of the present specification and the accompanying drawings, or directly/indirectly applied to other related technical fields, are included in the scope of the present invention.
Claims (10)
1. A method for playing a television program, which is used for a terminal device, the method comprising the steps of:
when a target television program is received, acquiring a key frame of the television program;
sending the key frame to a server so that the server can acquire data of the key frame to obtain text description information;
when receiving the character description information sent by the server, obtaining reading audio based on the character description information;
and playing the reading audio.
2. The method of claim 1, wherein prior to the step of sending the key frame to a server, the method further comprises:
acquiring current time corresponding to the key frame;
the step of sending the key frame to a server includes:
and sending the key frame and the current time to a server so that the server acquires data of the key frame to obtain picture description information, and marking the picture description information by using the current time to obtain text description information.
3. The method of claim 2, wherein the target television program is a television program consisting of a sequence of GOPs; the step of acquiring the key frame of the television program when receiving the target television program comprises the following steps:
when a target television program is received, acquiring the GOP sequence in the target television program;
determining an I-frame in the GOP sequence as the key frame.
4. The method of claim 3, wherein the step of obtaining speakable audio based on the textual description information upon receiving the textual description information sent by the server comprises:
when the character description information is received, decoding the character description information to obtain natural language information;
and when the current time in the text description information is reached, reading the natural language information to obtain the reading audio.
5. A method for playing back a television program, the method being used for a server, the method comprising the steps of:
receiving a key frame sent by terminal equipment, wherein the key frame is acquired in a television program when a target program is received;
acquiring data of the key frame to obtain text description information;
and sending the text description information to the terminal equipment so that the terminal equipment obtains a reading audio based on the text description information and plays the reading audio.
6. The method of claim 5, wherein before the step of collecting data from the key frames to obtain textual description information, the method further comprises:
receiving the current time sent by the terminal equipment, wherein the current time corresponds to the key frame;
the step of acquiring data of the key frame to obtain the text description information comprises the following steps:
acquiring data of the key frame to obtain picture description information;
and marking the picture description information by using the current time to obtain the text description information.
7. The method of claim 6, wherein the textual description information includes the current time, a name of a thing in the key frame, location information of the thing, and a feature of the thing.
8. A terminal device, characterized in that the terminal device comprises: memory, processor and a television program stored on said memory and running on said processor, said television program playing program when executed by said processor implementing the steps of the television program playing method as claimed in any one of claims 1 to 4.
9. A server, characterized in that the server comprises: memory, processor and a television program stored on said memory and running on said processor, said television program playing program when executed by said processor implementing the steps of the television program playing method as claimed in any one of claims 5 to 7.
10. A computer-readable storage medium, on which a television program playing program is stored, which when executed by a processor implements the steps of the television program playing method according to any one of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110427756.4A CN113225615B (en) | 2021-04-20 | 2021-04-20 | Television program playing method, terminal equipment, server and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110427756.4A CN113225615B (en) | 2021-04-20 | 2021-04-20 | Television program playing method, terminal equipment, server and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113225615A true CN113225615A (en) | 2021-08-06 |
CN113225615B CN113225615B (en) | 2023-08-08 |
Family
ID=77088080
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110427756.4A Active CN113225615B (en) | 2021-04-20 | 2021-04-20 | Television program playing method, terminal equipment, server and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113225615B (en) |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5677739A (en) * | 1995-03-02 | 1997-10-14 | National Captioning Institute | System and method for providing described television services |
CN101286274A (en) * | 2008-05-08 | 2008-10-15 | 李卫红 | Digital video automatic explaining system for blind men |
CN101458951A (en) * | 2008-12-30 | 2009-06-17 | 胡礼斌 | Video and audio program signal processing system having multiple functions |
US20090259473A1 (en) * | 2008-04-14 | 2009-10-15 | Chang Hisao M | Methods and apparatus to present a video program to a visually impaired person |
US20130107121A1 (en) * | 2011-10-28 | 2013-05-02 | Sony Corporation, A Japanese Corporation | Audio description availability notifier |
CN104980790A (en) * | 2015-06-30 | 2015-10-14 | 北京奇艺世纪科技有限公司 | Voice subtitle generating method and apparatus, and playing method and apparatus |
WO2018121001A1 (en) * | 2016-12-30 | 2018-07-05 | 深圳市九洲电器有限公司 | Method and system for outputting simultaneous interpretation of digital television program, and smart terminal |
CN109275027A (en) * | 2018-09-26 | 2019-01-25 | Tcl海外电子(惠州)有限公司 | Speech output method, electronic playback devices and the storage medium of video |
CN109672932A (en) * | 2018-12-29 | 2019-04-23 | 深圳Tcl新技术有限公司 | Assist method, system, equipment and the storage medium of people with visual impairment viewing video |
CN110519636A (en) * | 2019-09-04 | 2019-11-29 | 腾讯科技(深圳)有限公司 | Voice messaging playback method, device, computer equipment and storage medium |
CN111046223A (en) * | 2019-11-14 | 2020-04-21 | 李秉伦 | Voice assisting method, terminal, server and system for visually impaired |
CN111538862A (en) * | 2020-05-15 | 2020-08-14 | 北京百度网讯科技有限公司 | Method and device for explaining video |
CN112087672A (en) * | 2020-08-13 | 2020-12-15 | 浙江大学 | Video stream description generation method using intelligent terminal and server |
-
2021
- 2021-04-20 CN CN202110427756.4A patent/CN113225615B/en active Active
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5677739A (en) * | 1995-03-02 | 1997-10-14 | National Captioning Institute | System and method for providing described television services |
US20090259473A1 (en) * | 2008-04-14 | 2009-10-15 | Chang Hisao M | Methods and apparatus to present a video program to a visually impaired person |
CN101286274A (en) * | 2008-05-08 | 2008-10-15 | 李卫红 | Digital video automatic explaining system for blind men |
CN101458951A (en) * | 2008-12-30 | 2009-06-17 | 胡礼斌 | Video and audio program signal processing system having multiple functions |
US20130107121A1 (en) * | 2011-10-28 | 2013-05-02 | Sony Corporation, A Japanese Corporation | Audio description availability notifier |
CN104980790A (en) * | 2015-06-30 | 2015-10-14 | 北京奇艺世纪科技有限公司 | Voice subtitle generating method and apparatus, and playing method and apparatus |
WO2018121001A1 (en) * | 2016-12-30 | 2018-07-05 | 深圳市九洲电器有限公司 | Method and system for outputting simultaneous interpretation of digital television program, and smart terminal |
CN109275027A (en) * | 2018-09-26 | 2019-01-25 | Tcl海外电子(惠州)有限公司 | Speech output method, electronic playback devices and the storage medium of video |
CN109672932A (en) * | 2018-12-29 | 2019-04-23 | 深圳Tcl新技术有限公司 | Assist method, system, equipment and the storage medium of people with visual impairment viewing video |
CN110519636A (en) * | 2019-09-04 | 2019-11-29 | 腾讯科技(深圳)有限公司 | Voice messaging playback method, device, computer equipment and storage medium |
CN111046223A (en) * | 2019-11-14 | 2020-04-21 | 李秉伦 | Voice assisting method, terminal, server and system for visually impaired |
CN111538862A (en) * | 2020-05-15 | 2020-08-14 | 北京百度网讯科技有限公司 | Method and device for explaining video |
CN112087672A (en) * | 2020-08-13 | 2020-12-15 | 浙江大学 | Video stream description generation method using intelligent terminal and server |
Also Published As
Publication number | Publication date |
---|---|
CN113225615B (en) | 2023-08-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113766060B (en) | An information screen display method, electronic device and computer-readable storage medium | |
WO2022199352A1 (en) | Method for always on displaying, and electronic device | |
CN111448587A (en) | Display method, uploading method and device of advertisement pictures | |
CN112689172B (en) | Program playing method and device, set top box and storage medium | |
CN113163141A (en) | Display control method, display control device, television and computer-readable storage medium | |
CN113014830A (en) | Video blurring method, device, equipment and storage medium | |
CN112612526B (en) | Application program control method, device, terminal equipment and storage medium | |
CN113038232A (en) | Video playing method, device, equipment, server and storage medium | |
CN113099300B (en) | Program playing method, device, display terminal and storage medium | |
CN113225615B (en) | Television program playing method, terminal equipment, server and storage medium | |
CN114047890A (en) | Screen projection method, screen projection device, playing device, screen projection system and storage medium | |
CN112770167A (en) | Video display method and device, intelligent display terminal and storage medium | |
CN114495859B (en) | Picture display method, device, display terminal and storage medium | |
CN112437333B (en) | Program playing method, device, terminal equipment and storage medium | |
CN114173172B (en) | Data processing method, device, terminal equipment and storage medium | |
CN113538048A (en) | Demand information obtaining method and device, terminal equipment and storage medium | |
CN112532971A (en) | Image processing method, image processing device, display terminal and computer readable storage medium | |
CN112584215B (en) | Video transmission method and device, intelligent television and storage medium | |
CN112423004B (en) | Video data transmission method, device, transmitting end and storage medium | |
CN112911403B (en) | Event analysis method and device, television and computer readable storage medium | |
CN112349248B (en) | Screen brightness adjusting method and device, multimedia terminal and computer readable storage medium | |
CN112423037A (en) | Television program playing method, device, terminal equipment and computer storage medium | |
CN112218119B (en) | Video stream frame rate conversion method, device, terminal equipment and storage medium | |
CN113179451A (en) | Television control method and device, television and computer readable storage medium | |
CN118363692A (en) | Weather information-based display method, electronic device and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |