MongoDB Best Practices - Schema Design, Indexes

The document outlines best practices for optimizing MongoDB, emphasizing the importance of schema design, data embedding, and indexing for performance. It highlights that MongoDB's schema should be tailored to application needs rather than traditional relational database structures. Additionally, it discusses server sizing, replication, and sharding as strategies for managing performance and redundancy in larger databases.

Uploaded by

aryamoneycontrol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views4 pages

MongoDB Best Practices - Schema Design, Indexes

Uploaded by

aryamoneycontrol

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

MongoDB Best Practices: Schema

Design, Indexes & More

Sync, Store, Access and Visualize your Data

No-code ELT
Cloud Data Warehouse
Unlimited Connectors

Book a Demo
By Dawid Ziolkowski | August 2, 2022 | Updated On: September 15, 2023 | Data
Management
MongoDB database is really popular these days. Developers often use it instead of
MySQL, but these two platforms aren’t in direct competition. While MySQL is a
relational database, MongoDB is a NoSQL document-oriented database, so the two
work quite differently. And for that reason, optimizing MongoDB is not the same as
optimizing a traditional relational database, although some best practices are similar.
Read on to learn what to do and what to avoid when using MongoDB.

Understand Schema Differences Between

Relational and Document-based Databases
Let's start with the most important difference: the schema. Designing your database
schema is a crucial task, and while making changes is possible and common, it can be
expensive from an engineering perspective. When a database schema needs changes,
your deployment process becomes much more complicated, so good design is critical.
How do you design a good MongoDB database schema? Rule number one: don't
design it as you would with relational databases. It sounds logical to split your schema
into small table-like pieces, right?

In the case of MongoD, no. For relational databases, you usually construct a schema
based on the data. You need to figure out how to split the data your application will use
into tables so it’s logically organized and not duplicated. But when it comes to a
MongoDB schema, you should look not at the data itself, but at the application.
Specifically, how your application will use the data, what kind of queries it will likely
execute, and so on. This means that two different applications using the exact same
data might have very different schema designs in MongoDB, whereas for relational
databases the schema would probably be the same or very similar across applications.

Another thing you need to know is that MongoDB has almost no rules or guidelines on
how you should structure the data, because MongoDB operates on JSON-like
documents. This gives you the ability to embed data into arrays and objects within one
document. If you want to learn more about modelling data, take a look at this free
course from MongoDB.

Embed Your Data Instead of Relying on Joins

One of the best practices when using MongoDB is to embed your data within one
document instead of performing lookups or creating in-application joins. It may be a bit
counterintuitive, but MongoDB performs better when you stuff all the data you need into
one document. For example, instead of putting user details in one document and user
order history in another, chuck them into the same one. Reading documents is
extremely fast in MongoDB. Performing lookups or joins within the application is slower
in most cases.

Keep in mind that this is only a general rule and you should always start by
understanding your application query pattern. Including the data in the document is
preferred over lookup operations. But, of course, there is no point in dumping all
possible data in one document.

Use Indexes For Frequent Operations

Let's talk about indexes. This next MongoDB best practice is similar to what you'd do
with relational databases. In the previous best practice we mentioned that MongoDB
prefers to embed data (instead of splitting it into smaller logical pieces). Therefore it's
normal for MongoDB documents to become quite big. This will naturally impact
performance, but indexes can solve that. Indexes in MongoDB work pretty much the
same way as with relational databases. These special data structures store a small
subset of the whole document in order to speed up the matching of data for frequently
used queries.

For example, imagine that you have your user's data together with their order history in
a single document and you want to find all users who ordered something in the last
month. Normally (without indexes) MongoDB would have to scan the whole user
collection, going one by one through the user document and checking the last order
data for each user. It's not horrible; that's how the database performs a lot of operations.
But if you frequently ask the database for this kind of matching, then indexing will help
you a lot. Coming back to our example - with indexes, MongoDB stores a separate,
small list containing pointers to the data (for example user id, email address, or last
order date).

Properly Size Your Servers

It may sound obvious, but server RAM sizing in MongoDB is crucial. There are two
things to keep in mind: first, more memory won't increase the performance of your
database. It's not just a matter of getting the server with the most RAM memory you can
afford. Second, MongoDB performs best when its working set can fit an application's
RAM.

Get your guide to Modern Data Management

Download Now
Sizing your MongoDB machine is not dependent on the size of the database itself. It
doesn't matter if you have 100MB or 2TB of data in your MongoDB instance. What
matters is the size of indices and frequently accessed data. To size your MongoDB
instance, you need to perform some tests to find out how much data your application
normally uses. Then, make sure to use a server with slightly more memory than that. If
your working set won't fit in the RAM, MongoDB will read the data from disk. And even if
you use superfast SSD disks, the operation will be much slower than reading from
RAM.

So, how do you know if your MongoDB working set fits in your RAM? The simplest way
is to execute MongoDB's serverStatus command. From there, take a look at the pages-
read-into-cache and unmodified-pages-evicted metrics. If you see high numbers in
these two, it most likely means that your working set does not fit in your RAM memory.
Use Replication or Sharding
As with relational databases, another MongoDB best practice is to
use replication and/or sharding when your database becomes slow. MongoDB
implements replication by use of replica sets, and works similarly to other database
systems using primary and secondary nodes. You can instruct your application to run
some queries on secondary servers (or use load balancers), relieving some pressure
on your primary server.

What's good about MongoDB replication is that it also serves as a great redundancy
mechanism. Since it simply copies documents from primary to secondary nodes,
electing one of the secondary nodes to be a primary in case your original primary server
fails is simple. You won't run into any inconsistencies or complicated election processes
with MongoDB. Therefore, replicating your MongoDB is good not only for better
performance, but for redundancy.

Replication helps the most for small and medium databases, so once your dataset gets
really big, consider sharding. Although replication just copies all the data across multiple
servers, sharding actually splits the data into smaller pieces and distributes them across
servers. This brings great performance improvement for large data sets and allows you
to horizontally scale both reads and writes. You can read more about how it works here.

Summary
As you can see, MongoDB best practices are a mix of typical database best practices
and some specific to MongoDB. The nice thing about MongoDB is that you don’t need
to start worrying about performance until you have a relatively big database - it’s fast
and optimized by design. This doesn't mean you should ignore best practices when
working with smaller databases. Some of the best practices we mentioned aren’t just for
boosting performance, but can ensure good database design. They should always be
top of mind no matter the size of the database.

If you want to learn more about the differences between SQL and NoSQL databases,
take a look at our blog post here.

Data Modeling With Mongodb
No ratings yet
Data Modeling With Mongodb
22 pages
Atlas Best Practices
No ratings yet
Atlas Best Practices
19 pages
Prefer Embedding: Document Schema Design Cheatsheet
No ratings yet
Prefer Embedding: Document Schema Design Cheatsheet
1 page
MongoDB for Developers & DBAs
No ratings yet
MongoDB for Developers & DBAs
7 pages
FSD Unit III
No ratings yet
FSD Unit III
22 pages
MongoDB Features for Developers
100% (1)
MongoDB Features for Developers
5 pages
Advanced Developer Student Workbook
No ratings yet
Advanced Developer Student Workbook
30 pages
Mongodb Tips For Better Performance Blog Post
No ratings yet
Mongodb Tips For Better Performance Blog Post
4 pages
MongoDB Data Modeling - Sample Chapter
No ratings yet
MongoDB Data Modeling - Sample Chapter
40 pages
Mongodb Tutorial: Database Collection
No ratings yet
Mongodb Tutorial: Database Collection
36 pages
MongoDB Guide for Adobe Developers
No ratings yet
MongoDB Guide for Adobe Developers
7 pages
RDBMS To MongoDB Migration
No ratings yet
RDBMS To MongoDB Migration
20 pages
Ultimate Mongodb Cheatsheet
No ratings yet
Ultimate Mongodb Cheatsheet
5 pages
Mongodb Introduction: Presenter: John Page
No ratings yet
Mongodb Introduction: Presenter: John Page
63 pages
RDBMS To MongoDB Migration
No ratings yet
RDBMS To MongoDB Migration
19 pages
Mongodb Cheat Sheet
No ratings yet
Mongodb Cheat Sheet
10 pages
281511lecture Notes 2 - MongoDB Data Modeling-1718181255820
No ratings yet
281511lecture Notes 2 - MongoDB Data Modeling-1718181255820
13 pages
Module 3
No ratings yet
Module 3
22 pages
Unit 2 (MongoDB)
No ratings yet
Unit 2 (MongoDB)
17 pages
MongoDB: NoSQL Database Guide
No ratings yet
MongoDB: NoSQL Database Guide
6 pages
Mongo Lesson2
No ratings yet
Mongo Lesson2
43 pages
Mongo DB
No ratings yet
Mongo DB
99 pages
MEAN Ebook - CodeWithRandom
No ratings yet
MEAN Ebook - CodeWithRandom
524 pages
MongoDB vs RDBMS: Performance Insights
No ratings yet
MongoDB vs RDBMS: Performance Insights
12 pages
MongoDB Performance Best Practices
No ratings yet
MongoDB Performance Best Practices
15 pages
Mongodb
No ratings yet
Mongodb
28 pages
Mongo DB
No ratings yet
Mongo DB
3 pages
1664473609-Unit 5 - Database Management - MongoDB
No ratings yet
1664473609-Unit 5 - Database Management - MongoDB
23 pages
MongoDB Atlas Best Practices White Paper PDF
No ratings yet
MongoDB Atlas Best Practices White Paper PDF
21 pages
MST Unit 5
No ratings yet
MST Unit 5
6 pages
MongoDB Performance Best Practices
No ratings yet
MongoDB Performance Best Practices
15 pages
Complete Unit 3 Notes
No ratings yet
Complete Unit 3 Notes
30 pages
MST Unit-5
No ratings yet
MST Unit-5
14 pages
MongoDB for Developers
No ratings yet
MongoDB for Developers
17 pages
MongoDB: A Guide for Developers
No ratings yet
MongoDB: A Guide for Developers
50 pages
281510lecture - 1 Introduction To MongoDB-1718181125331
No ratings yet
281510lecture - 1 Introduction To MongoDB-1718181125331
22 pages
Mongodb
No ratings yet
Mongodb
60 pages
Homework 4.4 Mongodb
No ratings yet
Homework 4.4 Mongodb
6 pages
RDMS To mongoDB Migration
No ratings yet
RDMS To mongoDB Migration
18 pages
Mongo DB
No ratings yet
Mongo DB
8 pages
Beginner'S Guide To Concepts of Nosql and Mongodb: Documented By: - Maulin Shah
No ratings yet
Beginner'S Guide To Concepts of Nosql and Mongodb: Documented By: - Maulin Shah
5 pages
MongoDb
No ratings yet
MongoDb
15 pages
Screenshot 2024-09-21 at 8.36.35 AM
No ratings yet
Screenshot 2024-09-21 at 8.36.35 AM
31 pages
MongoDB Essentials for Developers
No ratings yet
MongoDB Essentials for Developers
144 pages
Migration of Relational Database To Mongodb
No ratings yet
Migration of Relational Database To Mongodb
7 pages
LU 3 and LU 4 NoSQL
No ratings yet
LU 3 and LU 4 NoSQL
36 pages
Mongodb
No ratings yet
Mongodb
9 pages
Module 5 Indexes
No ratings yet
Module 5 Indexes
4 pages
05 Chapter Performance MongoDB
No ratings yet
05 Chapter Performance MongoDB
42 pages
MongoDB Case Study 1
No ratings yet
MongoDB Case Study 1
6 pages
Rdbms To Mongodb Migration Guide: Considerations and Best Practices June 2015
No ratings yet
Rdbms To Mongodb Migration Guide: Considerations and Best Practices June 2015
17 pages
MongoDB - Course Curriculum
No ratings yet
MongoDB - Course Curriculum
5 pages
MongoDB Cheat Sheet
No ratings yet
MongoDB Cheat Sheet
17 pages
Mongodb DBA Homework 4.3 Answer
100% (1)
Mongodb DBA Homework 4.3 Answer
6 pages
MongoDB Essentials 2025
No ratings yet
MongoDB Essentials 2025
106 pages
Rdbms Bus Booking Management System
No ratings yet
Rdbms Bus Booking Management System
17 pages
HANA SQL SQLCache PinnedPlans 1.00.110+
No ratings yet
HANA SQL SQLCache PinnedPlans 1.00.110+
3 pages
Set 2
No ratings yet
Set 2
9 pages
Chap 9
No ratings yet
Chap 9
11 pages
DBMS Mini Project Movies Database GARVIT
No ratings yet
DBMS Mini Project Movies Database GARVIT
12 pages
SQL NULL Replacement Techniques
No ratings yet
SQL NULL Replacement Techniques
4 pages
CT113H Lecture 1 - Introduction To NoSQL
No ratings yet
CT113H Lecture 1 - Introduction To NoSQL
51 pages
PostgreSQL Architecture Deep-Dive - Brijesh Mehra
No ratings yet
PostgreSQL Architecture Deep-Dive - Brijesh Mehra
75 pages
PAM For Informatica Platform v10.4
No ratings yet
PAM For Informatica Platform v10.4
204 pages
MYSQL Booklet Answer Key (2022-23)
No ratings yet
MYSQL Booklet Answer Key (2022-23)
6 pages
Unit 1 Dbms - Patel
No ratings yet
Unit 1 Dbms - Patel
183 pages
DBMS Essentials for IT Professionals
No ratings yet
DBMS Essentials for IT Professionals
73 pages
Create Oracle Database on Linux
No ratings yet
Create Oracle Database on Linux
3 pages
Practical No. 1: Aim: Study About Distributed Database System. Theory
No ratings yet
Practical No. 1: Aim: Study About Distributed Database System. Theory
22 pages
CH - 5 Fundamentals of A Database System
No ratings yet
CH - 5 Fundamentals of A Database System
13 pages
Power BI & SQL Mastery Program
No ratings yet
Power BI & SQL Mastery Program
24 pages
Job Scheduling System
No ratings yet
Job Scheduling System
19 pages
Assignment DBMS: Sir Faizan Ahmad
No ratings yet
Assignment DBMS: Sir Faizan Ahmad
49 pages
Chapter 3: Introduction To SQL
No ratings yet
Chapter 3: Introduction To SQL
37 pages
Spring 2025 - CS405 - 1
No ratings yet
Spring 2025 - CS405 - 1
3 pages
System Data Dll-Resources Dat
No ratings yet
System Data Dll-Resources Dat
63 pages
Database Management System
50% (2)
Database Management System
244 pages
Mongodb - Quick Guide Mongodb Overview
No ratings yet
Mongodb - Quick Guide Mongodb Overview
18 pages
Recently Asked Data Analyst Interview Questions
No ratings yet
Recently Asked Data Analyst Interview Questions
4 pages
SQL Guide
No ratings yet
SQL Guide
19 pages
PI OLEDB and DTS SSIS
No ratings yet
PI OLEDB and DTS SSIS
24 pages
ITEC 4010: Systems Analysis and Design II: Mapping UML Object Models
No ratings yet
ITEC 4010: Systems Analysis and Design II: Mapping UML Object Models
39 pages
DBMS Lab Programs 4th Sem
100% (1)
DBMS Lab Programs 4th Sem
36 pages
SQL Transformation in Informatica With Examples
No ratings yet
SQL Transformation in Informatica With Examples
10 pages
SQL Interview Questions
100% (1)
SQL Interview Questions
25 pages

MongoDB Best Practices - Schema Design, Indexes

Uploaded by

MongoDB Best Practices - Schema Design, Indexes

Uploaded by

MongoDB Best Practices: Schema

Design, Indexes & More

Sync, Store, Access and Visualize your Data

Understand Schema Differences Between

Embed Your Data Instead of Relying on Joins

Use Indexes For Frequent Operations

Properly Size Your Servers

Get your guide to Modern Data Management

You might also like