[go: up one dir, main page]

IN2013CH01043A - - Google Patents

Download PDF

Info

Publication number
IN2013CH01043A
IN2013CH01043A IN1043CH2013A IN2013CH01043A IN 2013CH01043 A IN2013CH01043 A IN 2013CH01043A IN 1043CH2013 A IN1043CH2013 A IN 1043CH2013A IN 2013CH01043 A IN2013CH01043 A IN 2013CH01043A
Authority
IN
India
Prior art keywords
visual object
object detector
visual
class
media content
Prior art date
Application number
Inventor
Jain Vidit
Sudhakar Farfade Sachin
Original Assignee
Yahoo Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yahoo Inc filed Critical Yahoo Inc
Priority to IN1043CH2013 priority Critical patent/IN2013CH01043A/en
Priority to US13/894,814 priority patent/US9519659B2/en
Publication of IN2013CH01043A publication Critical patent/IN2013CH01043A/en
Priority to US15/375,868 priority patent/US10007838B2/en
Priority to US16/016,137 priority patent/US10176364B2/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Library & Information Science (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Medical Informatics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Computational Mathematics (AREA)
  • Algebra (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)

Abstract

ABSTRACT Disclosed herein are a system, method and architecture for media content enrichment. A visual object detector is trained using a training data set and an existing visual object detector. The newly-adapted visual object detector may be used to detect a visual object belonging to a class of visual object. The existing object detector that is used to train the adapted object detector detects a class of visual objects different from the visual object class detected by the adapted object detector. A media content item depicting a visual object detected using the adapted object detector may be associated with metadata, tag or other information about the detected visual object to enrich the media content item.
IN1043CH2013 2013-03-12 2013-03-12 IN2013CH01043A (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
IN1043CH2013 IN2013CH01043A (en) 2013-03-12 2013-03-12
US13/894,814 US9519659B2 (en) 2013-03-12 2013-05-15 Media content enrichment using an adapted object detector
US15/375,868 US10007838B2 (en) 2013-03-12 2016-12-12 Media content enrichment using an adapted object detector
US16/016,137 US10176364B2 (en) 2013-03-12 2018-06-22 Media content enrichment using an adapted object detector

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN1043CH2013 IN2013CH01043A (en) 2013-03-12 2013-03-12

Publications (1)

Publication Number Publication Date
IN2013CH01043A true IN2013CH01043A (en) 2015-08-14

Family

ID=51525363

Family Applications (1)

Application Number Title Priority Date Filing Date
IN1043CH2013 IN2013CH01043A (en) 2013-03-12 2013-03-12

Country Status (2)

Country Link
US (3) US9519659B2 (en)
IN (1) IN2013CH01043A (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104995662B (en) * 2013-03-20 2020-08-11 英特尔公司 Apparatus and method for managing avatar and apparatus for animating avatar
US10867328B2 (en) 2016-05-03 2020-12-15 Yembo, Inc. Systems and methods for providing AI-based cost estimates for services
EP3452972A4 (en) 2016-05-03 2020-01-01 Yembo, Inc. Systems and methods for providing ai-based cost estimates for services
US10552473B2 (en) * 2016-08-31 2020-02-04 Facebook, Inc. Systems and methods for processing media content that depict objects
CA3055381A1 (en) * 2017-03-10 2018-09-13 Walmart Apollo, Llc System and method for "always on" offline transaction collection
CN107341485B (en) * 2017-07-28 2019-12-31 江汉大学 Face recognition method and device
US10185628B1 (en) * 2017-12-07 2019-01-22 Cisco Technology, Inc. System and method for prioritization of data file backups
CN108236784B (en) * 2018-01-22 2021-09-24 腾讯科技(深圳)有限公司 Model training method and device, storage medium and electronic device
US10664966B2 (en) * 2018-01-25 2020-05-26 International Business Machines Corporation Anomaly detection using image-based physical characterization
CN109829481B (en) * 2019-01-04 2020-10-30 北京邮电大学 An image classification method, apparatus, electronic device and readable storage medium
US11068718B2 (en) * 2019-01-09 2021-07-20 International Business Machines Corporation Attribute classifiers for image classification
US11295167B2 (en) * 2020-04-27 2022-04-05 Toshiba Global Commerce Solutions Holdings Corporation Automated image curation for machine learning deployments
US11960569B2 (en) * 2021-06-29 2024-04-16 7-Eleven, Inc. System and method for refining an item identification model based on feedback
CN117557503B (en) * 2023-10-25 2024-06-25 维克多精密工业(深圳)有限公司 Thermal forming die detection method and system based on artificial intelligence

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6941323B1 (en) * 1999-08-09 2005-09-06 Almen Laboratories, Inc. System and method for image comparison and retrieval by enhancing, defining, and parameterizing objects in images
US7440586B2 (en) * 2004-07-23 2008-10-21 Mitsubishi Electric Research Laboratories, Inc. Object classification using image segmentation
US7840076B2 (en) * 2006-11-22 2010-11-23 Intel Corporation Methods and apparatus for retrieving images from a large collection of images
CN101965576B (en) * 2008-03-03 2013-03-06 视频监控公司 Object matching for tracking, indexing, and search
US8489627B1 (en) * 2008-08-28 2013-07-16 Adobe Systems Incorporated Combined semantic description and visual attribute search
US9639780B2 (en) * 2008-12-22 2017-05-02 Excalibur Ip, Llc System and method for improved classification
SG178829A1 (en) * 2009-09-16 2012-04-27 Univ Nanyang Tech Textual query based multimedia retrieval system
US8452763B1 (en) * 2009-11-19 2013-05-28 Google Inc. Extracting and scoring class-instance pairs
US8744172B2 (en) * 2011-06-15 2014-06-03 Siemens Aktiengesellschaft Image processing using random forest classifiers
US9075825B2 (en) * 2011-09-26 2015-07-07 The University Of Kansas System and methods of integrating visual features with textual features for image searching
US9224071B2 (en) * 2012-11-19 2015-12-29 Microsoft Technology Licensing, Llc Unsupervised object class discovery via bottom up multiple class learning

Also Published As

Publication number Publication date
US10176364B2 (en) 2019-01-08
US20170091530A1 (en) 2017-03-30
US20180300535A1 (en) 2018-10-18
US10007838B2 (en) 2018-06-26
US9519659B2 (en) 2016-12-13
US20140267219A1 (en) 2014-09-18

Similar Documents

Publication Publication Date Title
IN2013CH01043A (en)
MX2015001647A (en) Method and system for tagging information about image, apparatus and computer-readable recording medium thereof.
MX2017005802A (en) Media presentation modification using audio segment marking.
MX2016011979A (en) Media clip creation and distribution systems, apparatus, and methods.
BR112017011412A2 (en) data processing method, apparatus and system
MX2011013584A (en) Method and apparatus for modifying the presentation of content.
EP4236332A3 (en) Techniques and apparatus for editing video
PH12016500350B1 (en) Image processing apparatus and image processing method
MX2016001687A (en) Systems and methods for image classification by correlating contextual cues with images.
BR112016023510A8 (en) method for acquiring seismic data, recording system and method for generating a geophysical data product
WO2014140814A3 (en) Proof of presence via tag interactions
MX2016014071A (en) Method and apparatus for analyzing media content.
WO2014197360A3 (en) Method and system for providing sign data and sign history
MX2015012793A (en) Language learning environment.
WO2014085832A3 (en) Event investigation within an online research system
WO2013166140A3 (en) Playlist generation
WO2014102548A3 (en) Search system and corresponding method
WO2015160415A3 (en) Systems and methods for visual sentiment analysis
FR2990020B1 (en) CAPACITIVE DETECTION DEVICE WITH ARRANGEMENT OF CONNECTION TRACKS, AND METHOD USING SUCH A DEVICE.
MX352448B (en) Auto-adjusting content size rendered on a display.
GB2530463A (en) Identifying and extracting stratigraphic layers in one or more bodies representing a geological structure
GB201600107D0 (en) System and method for labeling messages from customer-agent interactions on social media to identify an issue and a response
WO2013088637A3 (en) Information processing device, information processing method and program
MX387045B (en) DETECTION OF COMMON MEDIA SEGMENTS.
MY183016A (en) Monitoring control system and control method