[go: up one dir, main page]

SG11202100469RA - Mapping object instances using video data - Google Patents

Mapping object instances using video data

Info

Publication number
SG11202100469RA
SG11202100469RA SG11202100469RA SG11202100469RA SG11202100469RA SG 11202100469R A SG11202100469R A SG 11202100469RA SG 11202100469R A SG11202100469R A SG 11202100469RA SG 11202100469R A SG11202100469R A SG 11202100469RA SG 11202100469R A SG11202100469R A SG 11202100469RA
Authority
SG
Singapore
Prior art keywords
video data
object instances
mapping object
mapping
instances
Prior art date
Application number
SG11202100469RA
Inventor
John Brendan Mccormac
Ronald Clark
Michael Bloesch
Andrew Davison
Stefan Leutenegger
Original Assignee
Imperial College Innovations Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Imperial College Innovations Ltd filed Critical Imperial College Innovations Ltd
Publication of SG11202100469RA publication Critical patent/SG11202100469RA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • G06T7/579Depth or shape recovery from multiple images from motion
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/02Control of position or course in two dimensions
    • G05D1/021Control of position or course in two dimensions specially adapted to land vehicles
    • G05D1/0231Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means
    • G05D1/0246Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means using a video camera in combination with image processing means
    • G05D1/0251Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means using a video camera in combination with image processing means extracting 3D information from a plurality of images taken from different locations, e.g. stereo vision
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/02Control of position or course in two dimensions
    • G05D1/021Control of position or course in two dimensions specially adapted to land vehicles
    • G05D1/0231Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means
    • G05D1/0246Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means using a video camera in combination with image processing means
    • G05D1/0253Control of position or course in two dimensions specially adapted to land vehicles using optical position detecting means using a video camera in combination with image processing means extracting relative motion information from a plurality of images taken successively, e.g. visual odometry, optical flow
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/20Control system inputs
    • G05D1/24Arrangements for determining position or orientation
    • G05D1/243Means capturing signals occurring naturally from the environment, e.g. ambient optical, acoustic, gravitational or magnetic signals
    • G05D1/2435Extracting 3D information
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05DSYSTEMS FOR CONTROLLING OR REGULATING NON-ELECTRIC VARIABLES
    • G05D1/00Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
    • G05D1/20Control system inputs
    • G05D1/24Arrangements for determining position or orientation
    • G05D1/243Means capturing signals occurring naturally from the environment, e.g. ambient optical, acoustic, gravitational or magnetic signals
    • G05D1/2437Extracting relative motion information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/162Segmentation; Edge detection involving graph-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/174Segmentation; Edge detection involving the use of two or more images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • G06T7/246Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
    • G06T7/251Analysis of motion using feature-based methods, e.g. the tracking of corners or segments involving models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/30Determination of transform parameters for the alignment of images, i.e. image registration
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • G06T7/73Determining position or orientation of objects or cameras using feature-based methods
    • G06T7/75Determining position or orientation of objects or cameras using feature-based methods involving models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • G06V10/464Salient features, e.g. scale invariant feature transforms [SIFT] using a plurality of salient features, e.g. bag-of-words [BoW] representations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20016Hierarchical, coarse-to-fine, multiscale or multiresolution image processing; Pyramid transform
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20072Graph-based image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20076Probabilistic image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30244Camera pose

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Aviation & Aerospace Engineering (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Automation & Control Theory (AREA)
  • Electromagnetism (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)
SG11202100469RA 2018-08-13 2019-08-07 Mapping object instances using video data SG11202100469RA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB1813197.9A GB2576322B (en) 2018-08-13 2018-08-13 Mapping object instances using video data
PCT/GB2019/052215 WO2020035661A1 (en) 2018-08-13 2019-08-07 Mapping object instances using video data

Publications (1)

Publication Number Publication Date
SG11202100469RA true SG11202100469RA (en) 2021-02-25

Family

ID=63667239

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202100469RA SG11202100469RA (en) 2018-08-13 2019-08-07 Mapping object instances using video data

Country Status (9)

Country Link
US (1) US12062200B2 (en)
EP (1) EP3837667A1 (en)
JP (1) JP2021534495A (en)
KR (1) KR20210042942A (en)
CN (1) CN112602116A (en)
GB (1) GB2576322B (en)
SG (1) SG11202100469RA (en)
TW (1) TW202034215A (en)
WO (1) WO2020035661A1 (en)

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2554633B (en) * 2016-06-24 2020-01-22 Imperial College Sci Tech & Medicine Detecting objects in video data
US11254002B1 (en) * 2018-03-19 2022-02-22 AI Incorporated Autonomous robotic device
WO2020023731A1 (en) * 2018-07-26 2020-01-30 Postmates Inc. Safe traversable area estimation in unstructure free-space using deep convolutional neural network
US11762394B2 (en) * 2018-11-01 2023-09-19 Nec Corporation Position detection apparatus, position detection system, remote control apparatus, remote control system, position detection method, and program
JP7167668B2 (en) * 2018-11-30 2022-11-09 コニカミノルタ株式会社 LEARNING METHOD, LEARNING DEVICE, PROGRAM AND RECORDING MEDIUM
GB2581808B (en) * 2019-02-26 2022-08-10 Imperial College Innovations Ltd Scene representation using image processing
CN112512755B (en) * 2019-03-01 2025-01-14 谷歌有限责任公司 Robotic manipulation using domain-invariant 3D representations predicted from 2.5D visual data
CN111666935B (en) * 2019-03-06 2024-05-24 北京京东乾石科技有限公司 Article center positioning method and device, logistics system and storage medium
CA3134424A1 (en) * 2019-03-18 2020-09-24 Geomagical Labs, Inc. Virtual interaction with three-dimensional indoor room imagery
CN110070056B (en) * 2019-04-25 2023-01-10 腾讯科技(深圳)有限公司 Image processing method, device, storage medium and equipment
US20220245829A1 (en) * 2019-05-27 2022-08-04 Nippon Telegraph And Telephone Corporation Movement status learning apparatus, movement status recognition apparatus, model learning method, movement status recognition method and program
US12051206B2 (en) * 2019-07-25 2024-07-30 Nvidia Corporation Deep neural network for segmentation of road scenes and animate object instances for autonomous driving applications
US11531088B2 (en) 2019-11-21 2022-12-20 Nvidia Corporation Deep neural network for detecting obstacle instances using radar sensors in autonomous machine applications
US12080078B2 (en) * 2019-11-15 2024-09-03 Nvidia Corporation Multi-view deep neural network for LiDAR perception
US11532168B2 (en) 2019-11-15 2022-12-20 Nvidia Corporation Multi-view deep neural network for LiDAR perception
US11885907B2 (en) 2019-11-21 2024-01-30 Nvidia Corporation Deep neural network for detecting obstacle instances using radar sensors in autonomous machine applications
US11410315B2 (en) * 2019-11-16 2022-08-09 Uatc, Llc High quality instance segmentation
US12050285B2 (en) 2019-11-21 2024-07-30 Nvidia Corporation Deep neural network for detecting obstacle instances using radar sensors in autonomous machine applications
US11536843B2 (en) * 2020-02-08 2022-12-27 The Boeing Company De-jitter of point cloud data for target recognition
WO2021171208A1 (en) * 2020-02-24 2021-09-02 Thales Canada Inc. Method for semantic object detection with knowledge graph
GB2593717B (en) * 2020-03-31 2022-08-24 Imperial College Innovations Ltd Image processing system and method
GB2593718B (en) 2020-03-31 2022-04-27 Imperial College Sci Tech & Medicine Image processing system and method
CN111709947B (en) * 2020-04-24 2024-04-02 浙江科技学院 Obvious object image detection method based on double-flow communication and global information guidance
JP2023538946A (en) 2020-08-25 2023-09-12 コモンウェルス サイエンティフィック アンド インダストリアル リサーチ オーガナイゼーション Multi-agent map generation
US11615544B2 (en) 2020-09-15 2023-03-28 Toyota Research Institute, Inc. Systems and methods for end-to-end map building from a video sequence using neural camera models
US11494927B2 (en) 2020-09-15 2022-11-08 Toyota Research Institute, Inc. Systems and methods for self-supervised depth estimation
US11508080B2 (en) 2020-09-15 2022-11-22 Toyota Research Institute, Inc. Systems and methods for generic visual odometry using learned features via neural camera models
US11321862B2 (en) 2020-09-15 2022-05-03 Toyota Research Institute, Inc. Systems and methods for multi-camera modeling with neural camera networks
KR102464130B1 (en) * 2020-09-17 2022-11-08 광주과학기술원 Apparatus and method identifying the size of the target object
US11657572B2 (en) 2020-10-21 2023-05-23 Argo AI, LLC Systems and methods for map generation based on ray-casting and semantic class images
US11633862B2 (en) 2020-11-25 2023-04-25 Metal Industries Research & Development Centre Automatic control method of mechanical arm and automatic control system
CN112581541B (en) * 2020-12-23 2025-01-14 苏州挚途科技有限公司 Parameter evaluation method, device and electronic device
US11501447B2 (en) * 2021-03-04 2022-11-15 Lemon Inc. Disentangled feature transforms for video object segmentation
CN113538576B (en) * 2021-05-28 2024-09-06 中国科学院自动化研究所 Grabbing method and device based on double-arm robot and double-arm robot
CN113223086B (en) * 2021-06-09 2022-05-03 司法鉴定科学研究院 Method and system for reconstructing vehicle running state suitable for low-quality monitoring video
CN113822134A (en) * 2021-07-19 2021-12-21 腾讯科技(深圳)有限公司 Instance tracking method, device, equipment and storage medium based on video
TWI782806B (en) * 2021-12-02 2022-11-01 財團法人國家實驗研究院 Point cloud rendering method
CN114187659A (en) * 2021-12-06 2022-03-15 河南牧原智能科技有限公司 Gesture recognition method and related products for recognizing pig gestures
US20230206625A1 (en) * 2021-12-23 2023-06-29 Here Global B.V. Method, apparatus, and system for pole extraction from optical imagery
CN114463232B (en) * 2021-12-27 2024-09-27 浙江大华技术股份有限公司 Image mapping method, electronic device and computer readable storage medium
CN114445376A (en) * 2022-01-27 2022-05-06 上海商汤智能科技有限公司 Image segmentation method and its model training method and related devices, equipment and media
CN115115652B (en) * 2022-05-09 2024-03-19 南京林业大学 On-line dividing method for street tree target point cloud
TWI830363B (en) * 2022-05-19 2024-01-21 鈺立微電子股份有限公司 Sensing device for providing three dimensional information
KR20240054780A (en) * 2022-10-19 2024-04-26 네이버랩스 주식회사 Method and system for correcting object pose
TWI817847B (en) * 2022-11-28 2023-10-01 國立成功大學 Method, computer program and computer readable medium for fast tracking and positioning objects in augmented reality and mixed reality

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4946535B2 (en) * 2007-03-12 2012-06-06 トヨタ自動車株式会社 Image recognition device
US10360718B2 (en) * 2015-08-14 2019-07-23 Samsung Electronics Co., Ltd. Method and apparatus for constructing three dimensional model of object
DE102016212695B4 (en) * 2016-05-31 2019-02-21 Siemens Aktiengesellschaft industrial robots
US10109055B2 (en) * 2016-11-21 2018-10-23 Seiko Epson Corporation Multiple hypotheses segmentation-guided 3D object detection and pose estimation
CN111133447B (en) * 2018-02-18 2024-03-19 辉达公司 Method and system for object detection and detection confidence for autonomous driving

Also Published As

Publication number Publication date
US12062200B2 (en) 2024-08-13
US20210166426A1 (en) 2021-06-03
CN112602116A (en) 2021-04-02
TW202034215A (en) 2020-09-16
KR20210042942A (en) 2021-04-20
GB2576322B (en) 2022-11-09
GB201813197D0 (en) 2018-09-26
JP2021534495A (en) 2021-12-09
EP3837667A1 (en) 2021-06-23
WO2020035661A1 (en) 2020-02-20
GB2576322A (en) 2020-02-19

Similar Documents

Publication Publication Date Title
SG11202100469RA (en) Mapping object instances using video data
GB2575436B (en) Guaranteed data compression
GB201804082D0 (en) Image annotation
CA187900S (en) Video encoder
GB2575437B (en) Guaranteed data compression
GB2575121B (en) Guaranteed data compression
GB201814068D0 (en) Transmittinf data
GB2568492B (en) Image data interpolation
GB201810793D0 (en) Guaranteed data compression
EP3850926C0 (en) Data centre
GB2575435B (en) Guaranteed data compression
GB2575434B (en) Guaranteed data compression
GB2577104B (en) Viewing data
EP3703061A4 (en) Image retrieval
GB2567149B (en) Managing data Compression
EP3673654A4 (en) Video data encoding
GB2587594B (en) Position data pseudonymization
GB2578421B (en) Data compression
GB2581014B (en) Sensor data management
GB2605094B (en) Guaranteed data compression
GB2585513B (en) Data compression
IL271662A (en) Context data management
GB201820951D0 (en) Application analytics
GB202012406D0 (en) Data compression
GB201818089D0 (en) Video media enhancement - regression processor