CN114787844A - Model training method, video processing method, device, storage medium and electronic device - Google Patents
Model training method, video processing method, device, storage medium and electronic device Download PDFInfo
- Publication number
- CN114787844A CN114787844A CN202080084487.XA CN202080084487A CN114787844A CN 114787844 A CN114787844 A CN 114787844A CN 202080084487 A CN202080084487 A CN 202080084487A CN 114787844 A CN114787844 A CN 114787844A
- Authority
- CN
- China
- Prior art keywords
- model
- classification
- sample
- audio
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title abstract 4
- 238000003672 processing method Methods 0.000 title 1
- 238000000605 extraction Methods 0.000 abstract 6
- 238000013145 classification model Methods 0.000 abstract 4
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation; Time management
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Entrepreneurship & Innovation (AREA)
- Human Resources & Organizations (AREA)
- Operations Research (AREA)
- Economics (AREA)
- Marketing (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Tourism & Hospitality (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Image Analysis (AREA)
Abstract
The embodiment of the application discloses a model training method, a video processing device, a storage medium and electronic equipment. The model training method comprises the following steps: the method comprises the steps of obtaining a video sample and a classification label corresponding to the video sample, and dividing the video sample into an image sample and an audio sample; constructing a basic model, wherein the basic model comprises an image feature extraction model, an audio feature extraction model and a classification model; extracting image characteristics of the image sample through an image characteristic extraction model, and extracting audio characteristics of the audio sample through an audio characteristic extraction model; inputting the image characteristics and the audio characteristics into a classification model for classification to obtain a prediction label corresponding to a video sample; and adjusting parameters of the image feature extraction model, the audio feature extraction model and the classification model according to the difference between the prediction label and the classification label until the basic model converges, and taking the converged basic model as a video classification model for video classification.
Description
PCT国内申请,说明书已公开。PCT domestic application, the description has been published.
Claims (19)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2020/071021 WO2021138855A1 (en) | 2020-01-08 | 2020-01-08 | Model training method, video processing method and apparatus, storage medium and electronic device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114787844A true CN114787844A (en) | 2022-07-22 |
Family
ID=76787433
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202080084487.XA Pending CN114787844A (en) | 2020-01-08 | 2020-01-08 | Model training method, video processing method, device, storage medium and electronic device |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114787844A (en) |
WO (1) | WO2021138855A1 (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113408664B (en) * | 2021-07-20 | 2024-04-16 | 北京百度网讯科技有限公司 | Training method, classification method, device, electronic equipment and storage medium |
CN113672252B (en) * | 2021-07-23 | 2024-07-12 | 浙江大华技术股份有限公司 | Model upgrading method, video monitoring system, electronic equipment and readable storage medium |
CN113806536B (en) * | 2021-09-14 | 2024-04-16 | 广州华多网络科技有限公司 | Text classification method and device, equipment, medium and product thereof |
CN114283350B (en) * | 2021-09-17 | 2024-06-07 | 腾讯科技(深圳)有限公司 | Visual model training and video processing method, device, equipment and storage medium |
CN113807281B (en) * | 2021-09-23 | 2024-03-29 | 深圳信息职业技术学院 | Image detection model generation method, detection method, terminal and storage medium |
CN113887505B (en) * | 2021-10-22 | 2025-04-29 | 镇江宏祥自动化科技有限公司 | Cattle image classification method, device, electronic device and storage medium |
CN114170425B (en) * | 2021-11-02 | 2025-01-07 | 阿里巴巴(中国)有限公司 | Model training, image classification methods, servers and storage media |
CN114443899B (en) * | 2022-01-28 | 2024-12-31 | 腾讯科技(深圳)有限公司 | Video classification method, device, equipment and medium |
CN114528762B (en) * | 2022-02-17 | 2024-02-20 | 腾讯科技(深圳)有限公司 | Model training method, device, equipment and storage medium |
CN114705467B (en) * | 2022-04-01 | 2025-04-25 | 深圳市玄羽科技有限公司 | An artificial intelligence mechanical equipment monitoring system and monitoring method |
CN114821401B (en) * | 2022-04-07 | 2024-08-27 | 腾讯科技(深圳)有限公司 | Video auditing method, device, equipment, storage medium and program product |
CN116996680B (en) * | 2023-09-26 | 2023-12-12 | 上海视龙软件有限公司 | Method and device for training video data classification model |
CN118570567B (en) * | 2024-08-02 | 2024-10-15 | 浙江省国土空间规划研究院 | Planning intent graph generation method and system based on image generation model |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050125223A1 (en) * | 2003-12-05 | 2005-06-09 | Ajay Divakaran | Audio-visual highlights detection using coupled hidden markov models |
US20150082349A1 (en) * | 2013-09-13 | 2015-03-19 | Arris Enterprises, Inc. | Content Based Video Content Segmentation |
CN109214374A (en) * | 2018-11-06 | 2019-01-15 | 北京达佳互联信息技术有限公司 | Video classification methods, device, server and computer readable storage medium |
CN109257622A (en) * | 2018-11-01 | 2019-01-22 | 广州市百果园信息技术有限公司 | A kind of audio/video processing method, device, equipment and medium |
CN110263217A (en) * | 2019-06-28 | 2019-09-20 | 北京奇艺世纪科技有限公司 | A kind of video clip label identification method and device |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104866596B (en) * | 2015-05-29 | 2018-09-14 | 北京邮电大学 | A kind of video classification methods and device based on autocoder |
CN109344781A (en) * | 2018-10-11 | 2019-02-15 | 上海极链网络科技有限公司 | Expression recognition method in a kind of video based on audio visual union feature |
CN109840509B (en) * | 2019-02-15 | 2020-12-01 | 北京工业大学 | Multi-level collaborative identification method and device for bad anchors in online live video |
-
2020
- 2020-01-08 WO PCT/CN2020/071021 patent/WO2021138855A1/en active Application Filing
- 2020-01-08 CN CN202080084487.XA patent/CN114787844A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050125223A1 (en) * | 2003-12-05 | 2005-06-09 | Ajay Divakaran | Audio-visual highlights detection using coupled hidden markov models |
US20150082349A1 (en) * | 2013-09-13 | 2015-03-19 | Arris Enterprises, Inc. | Content Based Video Content Segmentation |
CN109257622A (en) * | 2018-11-01 | 2019-01-22 | 广州市百果园信息技术有限公司 | A kind of audio/video processing method, device, equipment and medium |
CN109214374A (en) * | 2018-11-06 | 2019-01-15 | 北京达佳互联信息技术有限公司 | Video classification methods, device, server and computer readable storage medium |
CN110263217A (en) * | 2019-06-28 | 2019-09-20 | 北京奇艺世纪科技有限公司 | A kind of video clip label identification method and device |
Also Published As
Publication number | Publication date |
---|---|
WO2021138855A1 (en) | 2021-07-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114787844A (en) | Model training method, video processing method, device, storage medium and electronic device | |
US11776530B2 (en) | Speech model personalization via ambient context harvesting | |
CN110782878B (en) | Attention mechanism-based multi-scale audio scene recognition method | |
WO2020088216A1 (en) | Audio and video processing method and device, apparatus, and medium | |
CN114424253A (en) | Model training method and device, storage medium and electronic equipment | |
CN111095293A (en) | Image aesthetic processing method and electronic equipment | |
CN108694949B (en) | Speaker identification method and device based on reordering supervectors and residual error network | |
CN113841179A (en) | Image generation method and device, electronic device and storage medium | |
CN115699082A (en) | Defect detection method and device, storage medium and electronic equipment | |
CN110830807B (en) | Image compression method, device and storage medium | |
CN114556469A (en) | Data processing method and device, electronic equipment and storage medium | |
US20230335148A1 (en) | Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium | |
JP6274067B2 (en) | Information processing apparatus and information processing method | |
CN114761975A (en) | Data processing method and device, communication equipment and storage medium | |
CN114375466A (en) | Video scoring method, device, storage medium and electronic device | |
CN115104151A (en) | An offline speech recognition method and apparatus, electronic device and readable storage medium | |
CN116453023B (en) | Video abstraction system, method, electronic equipment and medium for 5G rich media information | |
CN111868823A (en) | A sound source separation method, device and equipment | |
CN107992937A (en) | Unstructured data decision method and device based on deep learning | |
CN113053361B (en) | Speech recognition method, model training method, device, equipment and medium | |
JP2025102754A (en) | Adaptive Visual Speech Recognition | |
CN111263946A (en) | Object recognition method and computer readable storage medium | |
WO2019127940A1 (en) | Video classification model training method, device, storage medium, and electronic device | |
EP0953968A3 (en) | Speaker and environment adaptation based on eigenvoices including maximum likelihood method | |
CN103390403A (en) | Extraction method and device for mel frequency cepstrum coefficient (MFCC) characteristics |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |