0% found this document useful (0 votes)

12 views6 pages

Sop Lec Notes

The document provides an overview of autonomous vehicles, focusing on their core components, development phases, and machine learning applications. It discusses the challenges of real-time decision-making, sensor data interpretation, and the integration of advanced machine learning techniques to enhance driving performance. Additionally, it highlights future prospects for equity in mobility and the importance of collaboration in research and development.

Uploaded by

Radhika p

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views6 pages

Sop Lec Notes

Uploaded by

Radhika p

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Study Notes: Autonomous Vehicles and Machine Learning

Overview of Autonomous Driving

● Key Challenge: Developing a driving agent that can interpret sensor data and make
safe, real-time decisions.
● Core Components:
○ Sensors: Camera, lidar, radar.
○ Driving agent: Software to process inputs and control the vehicle (steering,
acceleration, braking).
● Key Goals:
○ Ensure safety and robustness in a variety of environments.
○ Achieve near-zero tolerance for errors.
○ Operate within real-time latency constraints (often below 10 Hz).

Phases of Development

1. Phase 1 (Proof of Concept):

○ Demonstrated autonomous driving feasibility.

○ First fully autonomous drive in 2015 using a prototype vehicle (Firefly).
2. Phase 2 (Scaling Deployment):

○ 2020: Launch of Waymo One service for paying customers.

○ Current coverage:
■ 225 square miles in Phoenix Metro.
■ Entire city of San Francisco.
○ Expansion underway in Los Angeles and Austin, TX.

Capabilities and Features

● Sensor Fusion: Combines data from cameras, lidar, and radar for a comprehensive
view of the environment.
● Adaptability: Handles diverse conditions such as:
○ Urban, suburban, and freeway settings.
○ Varied weather and lighting conditions (e.g., rain, tunnels).
● Obstacle Interaction: Navigates complex environments, including:
○ Construction zones and detours.
○ Interactions with pedestrians, cyclists, and other road users.

Machine Learning Applications in Autonomous Vehicles

1. Data Processing:

○ Extract relevant features from a high-dimensional input space.

○ Handle tens of millions of sensor readings per second.
2. Context Understanding:

○ Recognize gestures, road signs, and traffic signals.

○ Interpret complex scenarios like lane merges, narrow roads, and
double-parked vehicles.
3. Rare Event Handling:

○ Identify and react to unexpected scenarios (e.g., objects falling from vehicles,
unconventional road users).

Accessibility Features

● For Blind or Visually Impaired Users:

○ Screen reader and high-contrast support in the app.
○ Audio tools providing context about the vehicle's actions.
● For Deaf Users:
○ Chat-based rider support in place of voice calls.
● For Wheelchair Users:
○ Partnerships with accessible transportation providers.
○ Mapping and annotation of accessibility features in high-definition 3D models.

Autonomous Driving Data Models

● High-Definition Maps:
○ Create semantic 3D models using sensor data.
○ Map curbs, sidewalks, driveways, and other accessibility-related features.
○ Enable centimeter-level resolution for accurate navigation.

Future Prospects

● Equity in Mobility:
○ Expand access to underserved and mobility-challenged populations.
○ Provide independence and reduce reliance on human-driven transport.
● Technology Enhancement:
○ Continuous improvement in handling rare events and extreme scenarios.
○ Partner with cities for accessibility improvements and infrastructure upgrades.
These notes consolidate essential information on autonomous vehicles, focusing on their
functionality, development phases, and machine learning applications.

Study Notes: Autonomous Vehicles and Machine Learning

Perception, Planning, and Machine Learning in Autonomous Driving

1. Core Objectives:

○ Use machine learning (ML) to enhance perception, planning, and overall

driving performance.
○ Aim to maximize data utility to create robust, safe, and performant driving
agents.
2. Testing and Validation Challenges:

○ Real-world driving for validation is limited by scale and safety risks,

especially for challenging maneuvers.
○ Solution: Use simulators for scalable system validation.

Simulators for Autonomous Vehicles

1. Capabilities Required:

○ Replay real-world scenarios to test agent behavior.

○ Generate new, unseen scenarios to explore corner cases.
○ Model multi-agent interactions (e.g., vehicles, pedestrians) dynamically.
2. Development Goals:

○ Build simulators from collected data.

○ Leverage ML to generalize and automate simulator design.

Autonomous Vehicle Architectures

1. Sensor Input and Perception Module:

○ Aggregate data from multiple sensors (e.g., lidar, cameras) into a bird's-eye
view (BEV) representation.
○ Construct a coherent 3D model of the environment for better generalization.
○ Intermediate outputs:
■ 3D objects and attributes.
■ Probabilistic occupancy grids for undefined shapes.
○ Advantages:
■ Improves generalization and simplifies analysis and validation.
■ Compresses high-dimensional sensor data for efficient use.
2. Feature Representation Trade-offs:

○ Pros: Facilitates data compression and simulation development.

○ Cons: Requires careful feature selection and labeling.

Behavior Modeling in Autonomous Driving

1. Trajectory Prediction:

○ Predict multiple potential trajectories for each observed agent (e.g., vehicles,
pedestrians).
○ Outputs: Probabilistic trajectories with uncertainties over time.
○ Example: Gaussian mixture models for predicting non-deterministic
behaviors.
2. Interaction Modeling:

○ Conventional methods predict agent behaviors independently, leading to

potential collisions in multi-agent scenarios.
○ Improved models incorporate inter-agent relationships to ensure realistic,
collision-free predictions.

Machine Learning Architectures

1. Perception Models:

○ Use SWFormer architecture for processing bird's-eye-view features.

■ Combines sparse lidar data with transformer-based processing.
■ Benefits: Handles large distances efficiently without quadratic scaling.
2. Behavior Models:

○ Transition from custom models to transformer-based architectures like

"Wayformer."
○ Benefits: Scalable, efficient, and capable of encoding complex scene
interactions.

Advanced Applications

1. Human Interaction Understanding:

○ Recognize and respond to gestures and signals (e.g., policemen, construction
workers).
○ Use pedestrian keypoint detection for gesture analysis.
2. Nighttime and Adverse Conditions:

○ Leverage lidar for enhanced perception in low-visibility conditions.

3. Multi-Agent Dynamics:

○ Simulate and predict realistic interactions between agents to avoid risky

behaviors.

These study notes provide an organized summary of key aspects of autonomous vehicle
development, focusing on perception, planning, and machine learning integration.

Here's the revised answer with the new content included:

The talk explores various techniques and methodologies for training robust machine learning
(ML) agents, particularly in the domain of autonomous driving. Below are the key takeaways,
including the new content provided:

1. PMM and Data Augmentation

○ The parameter P0P_0 (or P05P_{05}) is maintained consistently across all

data augmentation operations.
○ A global grid search is performed to find the optimal PMM across all
augmentation strategies.
○ This approach significantly improves the performance of models like
SWFormer through better data augmentation and is especially beneficial for
rare examples.
2. Addressing Covariate Shift

○ Covariate shift remains a challenge in ML-driven agents, particularly when

policies learned through behavior cloning deviate from their training
distribution over time, leading to compounding errors.
○ A novel approach called BC-SAC (Behavior Cloning Soft Actor-Critic) was
proposed. It combines imitation learning for in-distribution scenarios and
reinforcement learning (RL) for out-of-distribution robustness.
○ The BC-SAC method showed the most robust performance, outperforming
naive behavior cloning and other imitation learning techniques like MGAIL.
3. Vision-Language and Large Language Models (VLM/LLM)

○ These models, trained on vast internet-scale datasets, exhibit a deep

understanding of visual and textual information.
○ They can interpret complex scenarios like understanding parking rules from a
sign or analyzing a flipped car scene to provide actionable advice.
○ This knowledge is being integrated into ML agents to enhance their reasoning
and decision-making capabilities.
4. Leveraging Vision-Language Models for Training

○ A method was introduced to annotate LiDAR data using embeddings derived

from vision-language models.
○ By filtering out static objects and focusing on moving agents, this method
enables the training of object detectors without manual labeling.
○ The annotations help train detectors to recognize boxes and semantics
derived from scene information.
5. Generative AI for Scenario Simulation

○ Diffusion models are being used to create realistic scene variations and
generate plausible motion for traffic agents.
○ By incorporating language-based constraints, large language models can
define complex scenarios, such as "agents forming a formation around the
autonomous vehicle."
6. Future Directions and Open Questions

○ How to reconcile traditional scene representations (lanes, boxes, etc.) with

language-based descriptions.
○ Optimizing language-based scene understanding for onboard computational
constraints.
○ Exploring language-conditioned planners to refine scenarios or instructions
dynamically.
7. Call for Collaboration and Research

○ The speaker emphasized the exciting challenges and applications in the

domain, encouraging more researchers to contribute.
○ Data sets and resources are available on the Waymo Research page for
those interested in diving deeper.

This comprehensive overview highlights advancements and open questions in building

scalable, robust, and intelligent ML agents for autonomous driving.

A Survey of Deep Learning Techniques For Autonomous Driving
No ratings yet
A Survey of Deep Learning Techniques For Autonomous Driving
28 pages
Autonomous Vehicles and AI
No ratings yet
Autonomous Vehicles and AI
8 pages
AIin Autonomous Vihicalnew
No ratings yet
AIin Autonomous Vihicalnew
4 pages
Application of AI
No ratings yet
Application of AI
3 pages
AUfilesem 4 Reg 21
No ratings yet
AUfilesem 4 Reg 21
9 pages
ML Report
No ratings yet
ML Report
11 pages
MLP Bearing and Speed
No ratings yet
MLP Bearing and Speed
6 pages
DL - Unit V
No ratings yet
DL - Unit V
12 pages
Survey Deep Learning in Autonomous Driving
No ratings yet
Survey Deep Learning in Autonomous Driving
39 pages
Advanced Topics in Autonomous Driving Using Deep Learning: Presenter: Nasim Souly
No ratings yet
Advanced Topics in Autonomous Driving Using Deep Learning: Presenter: Nasim Souly
41 pages
DR +Amit+Bhardwaj
No ratings yet
DR +Amit+Bhardwaj
7 pages
Review 2 Report........
No ratings yet
Review 2 Report........
40 pages
Seminar Abstract
No ratings yet
Seminar Abstract
2 pages
AI & ML in Autonomous Driving
100% (1)
AI & ML in Autonomous Driving
27 pages
A Survey of Deep Learning Techniques For Autonomous Driving
No ratings yet
A Survey of Deep Learning Techniques For Autonomous Driving
25 pages
Autonomous Vehicles - Neebiralo-3
No ratings yet
Autonomous Vehicles - Neebiralo-3
11 pages
Experiment 8 AISC
No ratings yet
Experiment 8 AISC
6 pages
Ai in Autonomous Car Presentation
No ratings yet
Ai in Autonomous Car Presentation
13 pages
Autonomous Car Literature Review
100% (2)
Autonomous Car Literature Review
4 pages
A Survey of Deep Learning Applications To Autonomous Vehicle Control
No ratings yet
A Survey of Deep Learning Applications To Autonomous Vehicle Control
23 pages
Aguila JImenez Mario Projecto
No ratings yet
Aguila JImenez Mario Projecto
18 pages
Autonomous Vehicles and Machine Learning
No ratings yet
Autonomous Vehicles and Machine Learning
11 pages
Deep Learning in Self-Driving Cars
No ratings yet
Deep Learning in Self-Driving Cars
29 pages
Seminar Report
No ratings yet
Seminar Report
27 pages
AI in Autonomous Vehicles
No ratings yet
AI in Autonomous Vehicles
3 pages
Driving The Future: The Role of Artificial Intelligence in Autonomous Vehicles
No ratings yet
Driving The Future: The Role of Artificial Intelligence in Autonomous Vehicles
10 pages
Autoware Challenge 2023
No ratings yet
Autoware Challenge 2023
13 pages
Proceedings PDF
No ratings yet
Proceedings PDF
55 pages
Intelligence and Learning Algorithms Autonomous Vehicles: Evolution of Artificial
No ratings yet
Intelligence and Learning Algorithms Autonomous Vehicles: Evolution of Artificial
13 pages
Impact of Artificial Intelligence in Autonomous Vehicles
No ratings yet
Impact of Artificial Intelligence in Autonomous Vehicles
33 pages
Autonomous Vehicle Insights
No ratings yet
Autonomous Vehicle Insights
464 pages
List of Figures VI Glossary of Terms VII VIII General Introduction 1 Chapter 1 Project Framework 3 3
No ratings yet
List of Figures VI Glossary of Terms VII VIII General Introduction 1 Chapter 1 Project Framework 3 3
25 pages
Itmconf Icaect2023 01002
No ratings yet
Itmconf Icaect2023 01002
18 pages
Tian 等 - 2024 - DriveVLM the Convergence of Autonomous Driving and Large Vision-language Models
No ratings yet
Tian 等 - 2024 - DriveVLM the Convergence of Autonomous Driving and Large Vision-language Models
30 pages
Self DrivingCarUsingArtificialIntelligence
No ratings yet
Self DrivingCarUsingArtificialIntelligence
5 pages
Copia de Doc2
No ratings yet
Copia de Doc2
61 pages
Yang 等 - 2024 - LLM4Drive a Survey of Large Language Models for Autonomous Driving
No ratings yet
Yang 等 - 2024 - LLM4Drive a Survey of Large Language Models for Autonomous Driving
19 pages
AI in Autonomous Vehicles Report
No ratings yet
AI in Autonomous Vehicles Report
7 pages
(2505) A Survey of Autonomous Driving From A Deep Learning Perspective
No ratings yet
(2505) A Survey of Autonomous Driving From A Deep Learning Perspective
60 pages
An End-to-End Curriculum Learning Approach For Aut
No ratings yet
An End-to-End Curriculum Learning Approach For Aut
10 pages
Assignment 4
No ratings yet
Assignment 4
4 pages
Research Paper Report
No ratings yet
Research Paper Report
22 pages
AI in Self-Driving Cars: Applications & Challenges
No ratings yet
AI in Self-Driving Cars: Applications & Challenges
29 pages
Complete Research Paper Computer Vision
No ratings yet
Complete Research Paper Computer Vision
6 pages
Sample Case Study Report 3
No ratings yet
Sample Case Study Report 3
8 pages
Deep Learning Implementation of Self Driving Car: 15BCE0205 (Radhika Garodia) 15BCE0311 (Kanav Sethi)
No ratings yet
Deep Learning Implementation of Self Driving Car: 15BCE0205 (Radhika Garodia) 15BCE0311 (Kanav Sethi)
11 pages
AI-based Self-Driving Car
No ratings yet
AI-based Self-Driving Car
9 pages
AI For Autonomous Vehicles
No ratings yet
AI For Autonomous Vehicles
8 pages
Assignment 1: Abstract
No ratings yet
Assignment 1: Abstract
4 pages
A Survey For Foundation Models in Autonomous Driving
No ratings yet
A Survey For Foundation Models in Autonomous Driving
22 pages
Model-Based Agents in AI Course
No ratings yet
Model-Based Agents in AI Course
26 pages
AI's Role in Autonomous Vehicles
No ratings yet
AI's Role in Autonomous Vehicles
13 pages
Real Time Lane Detection and Collision Avoidance Method For Autonomous Vehicle
No ratings yet
Real Time Lane Detection and Collision Avoidance Method For Autonomous Vehicle
17 pages
Research Paper 2
No ratings yet
Research Paper 2
17 pages
Planning-Oriented Autonomous Driving
No ratings yet
Planning-Oriented Autonomous Driving
24 pages
Zachariya Paper 2
No ratings yet
Zachariya Paper 2
10 pages
传感器问题
No ratings yet
传感器问题
21 pages
Experiment 01 - Iai
No ratings yet
Experiment 01 - Iai
5 pages
Deep Learning Control for Self-Driving Cars
No ratings yet
Deep Learning Control for Self-Driving Cars
4 pages
SAP GUI Installation Guide
No ratings yet
SAP GUI Installation Guide
8 pages
RKAS 2020 SMP Negeri 3 Pekalongan
No ratings yet
RKAS 2020 SMP Negeri 3 Pekalongan
3 pages
Quiz Hệ Thống Thông Tin
No ratings yet
Quiz Hệ Thống Thông Tin
15 pages
Docker Multi-Stage Builds Guide
No ratings yet
Docker Multi-Stage Builds Guide
2 pages
Release Notes Safe V 2270 Plus 2260
No ratings yet
Release Notes Safe V 2270 Plus 2260
4 pages
Loan Management Software Project
No ratings yet
Loan Management Software Project
2 pages
JavaScript Arrays
No ratings yet
JavaScript Arrays
36 pages
HTML Meta and Code Tags Guide
No ratings yet
HTML Meta and Code Tags Guide
6 pages
How To Protect Worksheets and Unprotect Excel Sheet Without Password
No ratings yet
How To Protect Worksheets and Unprotect Excel Sheet Without Password
19 pages
BondMaster 1000ePLUS EN 0707
No ratings yet
BondMaster 1000ePLUS EN 0707
2 pages
Empowerment-Technologies LAS Week 5
No ratings yet
Empowerment-Technologies LAS Week 5
7 pages
YouTube API Log Analysis 2021
No ratings yet
YouTube API Log Analysis 2021
233 pages
White Box Testing for Engineers
No ratings yet
White Box Testing for Engineers
41 pages
Arunkumar S Resume
No ratings yet
Arunkumar S Resume
2 pages
Dataloader Instructions
No ratings yet
Dataloader Instructions
10 pages
My Resume
No ratings yet
My Resume
1 page
Copy-Verb in Cobol
No ratings yet
Copy-Verb in Cobol
16 pages
SpaceLogic™ AS-P Automation Server - SXWASPSBX10001
No ratings yet
SpaceLogic™ AS-P Automation Server - SXWASPSBX10001
2 pages
Product Compatibility Sheet Rev 1 4
No ratings yet
Product Compatibility Sheet Rev 1 4
18 pages
Activity # 4: Software Reuse
No ratings yet
Activity # 4: Software Reuse
54 pages
MM 1000 (2014) Introduction To Micromine (2014-07)
No ratings yet
MM 1000 (2014) Introduction To Micromine (2014-07)
316 pages
Network Security Midterm
No ratings yet
Network Security Midterm
21 pages
Word Lesson Note 2nd Term Js WK 1 To WK 10 2020
No ratings yet
Word Lesson Note 2nd Term Js WK 1 To WK 10 2020
65 pages
My First Python Codes - Jupyter Notebook
No ratings yet
My First Python Codes - Jupyter Notebook
51 pages
Chatgpt Guide
100% (2)
Chatgpt Guide
56 pages
Computer History & Applications
90% (10)
Computer History & Applications
119 pages
Sri Vidya College of Engineering & Technology Lecture Notes
No ratings yet
Sri Vidya College of Engineering & Technology Lecture Notes
27 pages
Lab Mannual OS CS 583
No ratings yet
Lab Mannual OS CS 583
47 pages
Epson USB Interface Board Manual
No ratings yet
Epson USB Interface Board Manual
2 pages
Introduction To Logic Programming 1st Edition Michael Genesereth PDF Version
No ratings yet
Introduction To Logic Programming 1st Edition Michael Genesereth PDF Version
93 pages

Sop Lec Notes

Uploaded by

Sop Lec Notes

Uploaded by

Study Notes: Autonomous Vehicles and Machine Learning

Overview of Autonomous Driving

1.​ Phase 1 (Proof of Concept):​

○​ Demonstrated autonomous driving feasibility.

○​ 2020: Launch of Waymo One service for paying customers.

Capabilities and Features

Machine Learning Applications in Autonomous Vehicles

○​ Extract relevant features from a high-dimensional input space.

○​ Recognize gestures, road signs, and traffic signals.

●​ For Blind or Visually Impaired Users:

Autonomous Driving Data Models

Study Notes: Autonomous Vehicles and Machine Learning

Perception, Planning, and Machine Learning in Autonomous Driving

1.​ Core Objectives:​

○​ Use machine learning (ML) to enhance perception, planning, and overall

○​ Real-world driving for validation is limited by scale and safety risks,

Simulators for Autonomous Vehicles

1.​ Capabilities Required:​

○​ Replay real-world scenarios to test agent behavior.

○​ Build simulators from collected data.

Autonomous Vehicle Architectures

1.​ Sensor Input and Perception Module:​

○​ Pros: Facilitates data compression and simulation development.

Behavior Modeling in Autonomous Driving

1.​ Trajectory Prediction:​

○​ Conventional methods predict agent behaviors independently, leading to

Machine Learning Architectures

1.​ Perception Models:​

○​ Use SWFormer architecture for processing bird's-eye-view features.

○​ Transition from custom models to transformer-based architectures like

1.​ Human Interaction Understanding:​

○​ Leverage lidar for enhanced perception in low-visibility conditions.

○​ Simulate and predict realistic interactions between agents to avoid risky

Here's the revised answer with the new content included:

1.​ PMM and Data Augmentation​

○​ The parameter P0P_0 (or P05P_{05}) is maintained consistently across all

○​ Covariate shift remains a challenge in ML-driven agents, particularly when

○​ These models, trained on vast internet-scale datasets, exhibit a deep

○​ A method was introduced to annotate LiDAR data using embeddings derived

○​ How to reconcile traditional scene representations (lanes, boxes, etc.) with

○​ The speaker emphasized the exciting challenges and applications in the

This comprehensive overview highlights advancements and open questions in building

You might also like

1. Phase 1 (Proof of Concept):

○ Demonstrated autonomous driving feasibility.

○ 2020: Launch of Waymo One service for paying customers.

○ Extract relevant features from a high-dimensional input space.

○ Recognize gestures, road signs, and traffic signals.

● For Blind or Visually Impaired Users:

1. Core Objectives:

○ Use machine learning (ML) to enhance perception, planning, and overall

○ Real-world driving for validation is limited by scale and safety risks,

1. Capabilities Required:

○ Replay real-world scenarios to test agent behavior.

○ Build simulators from collected data.

1. Sensor Input and Perception Module:

○ Pros: Facilitates data compression and simulation development.

1. Trajectory Prediction:

○ Conventional methods predict agent behaviors independently, leading to

1. Perception Models:

○ Use SWFormer architecture for processing bird's-eye-view features.

○ Transition from custom models to transformer-based architectures like

1. Human Interaction Understanding:

○ Leverage lidar for enhanced perception in low-visibility conditions.

○ Simulate and predict realistic interactions between agents to avoid risky

1. PMM and Data Augmentation

○ The parameter P0P_0 (or P05P_{05}) is maintained consistently across all

○ Covariate shift remains a challenge in ML-driven agents, particularly when

○ These models, trained on vast internet-scale datasets, exhibit a deep

○ A method was introduced to annotate LiDAR data using embeddings derived

○ How to reconcile traditional scene representations (lanes, boxes, etc.) with

○ The speaker emphasized the exciting challenges and applications in the