[go: up one dir, main page]

CN114787844A - Model training method, video processing method, device, storage medium and electronic device - Google Patents

Model training method, video processing method, device, storage medium and electronic device Download PDF

Info

Publication number
CN114787844A
CN114787844A CN202080084487.XA CN202080084487A CN114787844A CN 114787844 A CN114787844 A CN 114787844A CN 202080084487 A CN202080084487 A CN 202080084487A CN 114787844 A CN114787844 A CN 114787844A
Authority
CN
China
Prior art keywords
model
classification
sample
audio
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202080084487.XA
Other languages
Chinese (zh)
Inventor
郭子亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Shenzhen Huantai Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd, Shenzhen Huantai Technology Co Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Publication of CN114787844A publication Critical patent/CN114787844A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Operations Research (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The embodiment of the application discloses a model training method, a video processing device, a storage medium and electronic equipment. The model training method comprises the following steps: the method comprises the steps of obtaining a video sample and a classification label corresponding to the video sample, and dividing the video sample into an image sample and an audio sample; constructing a basic model, wherein the basic model comprises an image feature extraction model, an audio feature extraction model and a classification model; extracting image characteristics of the image sample through an image characteristic extraction model, and extracting audio characteristics of the audio sample through an audio characteristic extraction model; inputting the image characteristics and the audio characteristics into a classification model for classification to obtain a prediction label corresponding to a video sample; and adjusting parameters of the image feature extraction model, the audio feature extraction model and the classification model according to the difference between the prediction label and the classification label until the basic model converges, and taking the converged basic model as a video classification model for video classification.

Description

PCT国内申请,说明书已公开。PCT domestic application, the description has been published.

Claims (19)

PCT国内申请,权利要求书已公开。PCT domestic application, the claims have been published.
CN202080084487.XA 2020-01-08 2020-01-08 Model training method, video processing method, device, storage medium and electronic device Pending CN114787844A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/071021 WO2021138855A1 (en) 2020-01-08 2020-01-08 Model training method, video processing method and apparatus, storage medium and electronic device

Publications (1)

Publication Number Publication Date
CN114787844A true CN114787844A (en) 2022-07-22

Family

ID=76787433

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202080084487.XA Pending CN114787844A (en) 2020-01-08 2020-01-08 Model training method, video processing method, device, storage medium and electronic device

Country Status (2)

Country Link
CN (1) CN114787844A (en)
WO (1) WO2021138855A1 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408664B (en) * 2021-07-20 2024-04-16 北京百度网讯科技有限公司 Training method, classification method, device, electronic equipment and storage medium
CN113672252B (en) * 2021-07-23 2024-07-12 浙江大华技术股份有限公司 Model upgrading method, video monitoring system, electronic equipment and readable storage medium
CN113806536B (en) * 2021-09-14 2024-04-16 广州华多网络科技有限公司 Text classification method and device, equipment, medium and product thereof
CN114283350B (en) * 2021-09-17 2024-06-07 腾讯科技(深圳)有限公司 Visual model training and video processing method, device, equipment and storage medium
CN113807281B (en) * 2021-09-23 2024-03-29 深圳信息职业技术学院 Image detection model generation method, detection method, terminal and storage medium
CN113887505B (en) * 2021-10-22 2025-04-29 镇江宏祥自动化科技有限公司 Cattle image classification method, device, electronic device and storage medium
CN114170425B (en) * 2021-11-02 2025-01-07 阿里巴巴(中国)有限公司 Model training, image classification methods, servers and storage media
CN114443899B (en) * 2022-01-28 2024-12-31 腾讯科技(深圳)有限公司 Video classification method, device, equipment and medium
CN114528762B (en) * 2022-02-17 2024-02-20 腾讯科技(深圳)有限公司 Model training method, device, equipment and storage medium
CN114705467B (en) * 2022-04-01 2025-04-25 深圳市玄羽科技有限公司 An artificial intelligence mechanical equipment monitoring system and monitoring method
CN114821401B (en) * 2022-04-07 2024-08-27 腾讯科技(深圳)有限公司 Video auditing method, device, equipment, storage medium and program product
CN116996680B (en) * 2023-09-26 2023-12-12 上海视龙软件有限公司 Method and device for training video data classification model
CN118570567B (en) * 2024-08-02 2024-10-15 浙江省国土空间规划研究院 Planning intent graph generation method and system based on image generation model

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050125223A1 (en) * 2003-12-05 2005-06-09 Ajay Divakaran Audio-visual highlights detection using coupled hidden markov models
US20150082349A1 (en) * 2013-09-13 2015-03-19 Arris Enterprises, Inc. Content Based Video Content Segmentation
CN109214374A (en) * 2018-11-06 2019-01-15 北京达佳互联信息技术有限公司 Video classification methods, device, server and computer readable storage medium
CN109257622A (en) * 2018-11-01 2019-01-22 广州市百果园信息技术有限公司 A kind of audio/video processing method, device, equipment and medium
CN110263217A (en) * 2019-06-28 2019-09-20 北京奇艺世纪科技有限公司 A kind of video clip label identification method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104866596B (en) * 2015-05-29 2018-09-14 北京邮电大学 A kind of video classification methods and device based on autocoder
CN109344781A (en) * 2018-10-11 2019-02-15 上海极链网络科技有限公司 Expression recognition method in a kind of video based on audio visual union feature
CN109840509B (en) * 2019-02-15 2020-12-01 北京工业大学 Multi-level collaborative identification method and device for bad anchors in online live video

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050125223A1 (en) * 2003-12-05 2005-06-09 Ajay Divakaran Audio-visual highlights detection using coupled hidden markov models
US20150082349A1 (en) * 2013-09-13 2015-03-19 Arris Enterprises, Inc. Content Based Video Content Segmentation
CN109257622A (en) * 2018-11-01 2019-01-22 广州市百果园信息技术有限公司 A kind of audio/video processing method, device, equipment and medium
CN109214374A (en) * 2018-11-06 2019-01-15 北京达佳互联信息技术有限公司 Video classification methods, device, server and computer readable storage medium
CN110263217A (en) * 2019-06-28 2019-09-20 北京奇艺世纪科技有限公司 A kind of video clip label identification method and device

Also Published As

Publication number Publication date
WO2021138855A1 (en) 2021-07-15

Similar Documents

Publication Publication Date Title
CN114787844A (en) Model training method, video processing method, device, storage medium and electronic device
US11776530B2 (en) Speech model personalization via ambient context harvesting
CN110782878B (en) Attention mechanism-based multi-scale audio scene recognition method
WO2020088216A1 (en) Audio and video processing method and device, apparatus, and medium
CN114424253A (en) Model training method and device, storage medium and electronic equipment
CN111095293A (en) Image aesthetic processing method and electronic equipment
CN108694949B (en) Speaker identification method and device based on reordering supervectors and residual error network
CN113841179A (en) Image generation method and device, electronic device and storage medium
CN115699082A (en) Defect detection method and device, storage medium and electronic equipment
CN110830807B (en) Image compression method, device and storage medium
CN114556469A (en) Data processing method and device, electronic equipment and storage medium
US20230335148A1 (en) Speech Separation Method, Electronic Device, Chip, and Computer-Readable Storage Medium
JP6274067B2 (en) Information processing apparatus and information processing method
CN114761975A (en) Data processing method and device, communication equipment and storage medium
CN114375466A (en) Video scoring method, device, storage medium and electronic device
CN115104151A (en) An offline speech recognition method and apparatus, electronic device and readable storage medium
CN116453023B (en) Video abstraction system, method, electronic equipment and medium for 5G rich media information
CN111868823A (en) A sound source separation method, device and equipment
CN107992937A (en) Unstructured data decision method and device based on deep learning
CN113053361B (en) Speech recognition method, model training method, device, equipment and medium
JP2025102754A (en) Adaptive Visual Speech Recognition
CN111263946A (en) Object recognition method and computer readable storage medium
WO2019127940A1 (en) Video classification model training method, device, storage medium, and electronic device
EP0953968A3 (en) Speaker and environment adaptation based on eigenvoices including maximum likelihood method
CN103390403A (en) Extraction method and device for mel frequency cepstrum coefficient (MFCC) characteristics

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination