0% found this document useful (0 votes)

50 views42 pages

Building Graphs

The document provides an overview of a presentation on building graph applications with Neo4j. The presentation will include an introduction to Neo4j and graph databases, demonstrate building a recommendation system using a movie graph as an example, and show how to query the graph to find similar users and provide recommendations. It will also discuss extending the recommendation system by incorporating additional data and machine learning techniques.

Uploaded by

Mrinny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views42 pages

Building Graphs

Uploaded by

Mrinny

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Building Graph Applications with Neo4j

Neo4j Manchester Meetup

Wednesday 1st March
Contents
● Introduction
● The Graph
○ Introduction to the demo
○ Neo4j Primer
○ The graph database
○ Recommendations
● Building a front end around Neo4j
○ Motivations
○ Data management
○ Live demo
Metafused
● Use AI to optimise existing and build net new
applications for a broad set of verticals
○ AI is applied to reduce friction, optimise processes, execute
action(s) based on observed trigger(s)
● Data driven
○ Being comfortable with being uncomfortable
○ Using the right tool(s) for the job
○ Build, measure, learn
Our Team
Seven Bridges of Königsberg
‘Devise a walk through the
city that would cross each of
those bridges once and only
once.’

Leonhard Euler (1736) proved

that, with some problems, you
can’t solve them doing the
same thing you did yesterday
… and expecting different
results.
Thinking Differently
Making the Case for the
Bottom Up Approach

‘You can’t do much

carpentry with bare
hands. Neither can you
do much thinking with a
bare brain.’ -Daniel
Dennett
Or, if you prefer not to have to read ...

But you will have to watch Brad Pitt.

Sorry.
Backend Design
Building Applications with Neo4j
● Presentation of a prototype application based on a well
known problem
○ Demonstrate capability of technology stack
○ Show how Metafused are using Neo4j as a key component of our
architecture
○ Demonstrate how Neo4j can be used as part of an AI system
● Building a live recommendation system
Neo4j Primer
● Nodes
○ Objects representing entities
● Labels
○ Assigned to nodes to specify the type of entity
● Relationships
○ Directional connections between nodes
● Properties
○ Additional information which can be attached to nodes and
relationships
● Cypher
○ Declarative query language for Neo4j
● Patterns
○ Selected combinations of nodes and relationships
● Let’s explore these concepts along with the graph...
A Known Graph
To demonstrate the technology, start simple. A great use case for Neo4j is the Movie graph
seen in many of the training examples.
A Simple Query (1)
Start with a simple Cypher query, find a node, with label Person and name property equal to
“Andrew Stanton”.

MATCH (p :Person) WHERE p.name = "Andrew Stanton" RETURN p;

A Simple Query (2)
We can look at specific relationships to answer simple questions, for example what movies
have Andrew Stanton had a relationship with, or more specifically directed?

MATCH (p :Person)-[r]->(m :Movie) WHERE p.name = "Andrew Stanton" RETURN p, r, m;

MATCH (p :Person)-[d :DIRECTED]->(m :Movie) WHERE p.name = "Andrew Stanton" RETURN p.name, collect(m.title);
The Data
● Data gathered from IMDb using IMDbPY
○ http://imdbpy.sourceforge.net/
○ IMDb not for commercial use, OK for this presentation
● Use IMDbPY to query database
● Either
○ Write script to build csv files for nodes and relationships
○ Directly ingest with py2neo (http://py2neo.org/v3/)
● This demo
○ 325 movies + full cast, directors, producers, writers
○ Easily extend with any further information e.g. full crew, trivia,
keywords etc
○ Initially 10 example users with ratings assigned to 25% of movies
following a defined distribution
User Query (1)
Who are the most active users of our recommendation engine

MATCH (u :User)-[r :RATED]->(m :Movie)

RETURN u.name AS Name,
u.username AS Username,
count(r) AS Reviews,
avg(r.rating) AS `Average Score`
ORDER BY Reviews DESC
LIMIT 5;
User Query (2)
We can compare two users reviews

MATCH (u1 :User)-[r1 :RATED]->(m :Movie)<-[r2 :RATED]-(u2 :User)

WHERE u1.name = "Malcolm Reynolds" AND u2.name = "Jayne Cobb"
WITH m,
r1.rating AS score_mal,
r2.rating AS score_jayne
MATCH (:User)-[r :RATED]->(m)
RETURN m.title AS Movie,
score_mal AS `Mal's Rating`,
score_jayne AS `Jayne's Rating`,
count(r.rating) AS `Total Reviews`,
avg(r.rating) AS `Average Rating`
ORDER BY (score_mal + score_jayne)/2 DESC
LIMIT 3;
Similarity
We can measure how similar users are based on their reviews.
To do this we will be using the Euclidean distance. The
smaller the value the more similar the user.
Similarity - Example

Rating 1 Rating 2 Diff Diff Squared

1 5 -4 16
1 4.5 -3.5 12.25
1 2.5 -1.5 2.25
1 4 -3 9
1 1 0 0
Total 39.5
Distance 6.28
Similarity - Example

Rating 1 Rating 2 Diff Diff Squared

5 5 0 0
4.5 4.5 0 0
2.5 2.5 0 0
4 4 0 0
1 1 0 0
Total 0
Distance 0.00
Nearest Neighbours
We can calculate similarities in Neo4j and then find a user’s nearest neighbours.

// Update similarities between users

MATCH (u1:User)-[x:RATED]->(m:Movie)<-[y:RATED]-(u2:User)
WHERE u1 <> u2
WITH SQRT(REDUCE(acc = 0.0, dif IN COLLECT(x.rating - y.rating) | acc + dif^2))/count(m) AS sim, u1, u2, m
MERGE (u1)-[s:SIMILARITY]-(u2)
SET s.similarity = sim;

// Get nearest neighbors (lower the better)

MATCH (u1 :User)-[s :SIMILARITY]-(u2 :User)
WHERE u1.name = “Malcolm Reynolds”
WITH u2.name AS Neighbor,
s.similarity AS sim
ORDER BY sim
RETURN Neighbor,
sim AS Similarity
LIMIT 10;
Nearest Neighbours
We can calculate similarities in Neo4j and then find a user’s nearest neighbours.
Recommendations
We can now design our recommendation engine.

1. User logs in and registers in the database

2. User rates a new movie
3. System
a. Updates database information
b. Finds user’s new nearest neighbors
c. Calculate average reviews of movies by x most similar users
d. Exclude movies already rated by the user
e. Order movies by average rating
f. Deliver recommendations
4. User filters results, for example by genre

Lets see how that looks...

Recommendation Query
We can now design our recommendation engine.

// Match user a to users who have rated 2 or more of the same films WITH m,
MATCH (u2 :User)-[:RATED]->(film: Movie)<-[:RATED]-(u1 :User {id: "usr0"}) REDUCE(s = 0, i IN COLLECT(rating) | s + i) * 1.0 AS rating_sum,
WITH u1, SIZE(COLLECT(rating)) AS n_ratings
u2, // Movie must have at least 2 ratings
COUNT(film) AS film_count WHERE n_ratings > 1
// Reviewed two or more of the same films // Get the average review
WHERE film_count > 1 WITH m,
WITH u1, rating_sum/n_ratings AS reco,
u2 n_ratings
// MATCH Users similarities // Get the genres
MATCH (u2)-[s:SIMILARITY]-(u1) MATCH (m)-[:HAS_GENRE]->(g :Genre)
WITH u1, u2, s.similarity AS similarity WITH m,
ORDER BY similarity reco,
LIMIT 3 COLLECT(g.name) AS genres,
WITH u1, u2, similarity n_ratings
// Get movies rated by the similar users // Order by the average recommendation score (not average rating)
MATCH (u2)-[r:RATED]->(m :Movie) // and then n_ratings
// Only movies user 1 hasn't seen ORDER BY reco DESC,
WHERE NOT((u1)-[:RATED]->(m)) n_ratings DESC
WITH m, // Return list
similarity, RETURN m.title,
r.rating AS rating reco AS score,
// Group movies genres
ORDER BY m.title LIMIT 10
Recommendation Query
Extending the Recommender
The current setup demonstrates the power of a graph database
for recommendation, this could be extended in several ways

● More complex queries incorporating further information

e.g. favourite actors
○ Natural use of Neo4j
○ Easily scales
○ Simple to understand
○ Real time
● Scale with more complex Machine Learning
○ Expand on simple nearest neighbour example
○ Include additional contextual information
Google Cloud Platform
We are using Google Cloud Platform (GCP) to help us build and deploy services/applications.

● Google are working

towards a ‘no-ops’
environment
● Incorporate virtual
machines, clusters,
software and APIs
● Allows for rapid
prototyping and focus on
areas of expertise
Future Stack
Incorporating Machine Learning into the application is easy with GCP.

Ingest Data

Process Data Learn/Optimise

Store
Batch/Stream
Front End

Store
Moving to an Application
We have built a simple recommendation engine around data
stored in Neo4j.

● How is this delivered to the user?

● How does the front end communicate with Neo4j?
Front End
Introduction
● Working using a build, measure, learn methodology allows
us to work through prototypes quickly
● Front end choices motivated by ability to reuse
components
● Walk through some design choices for Movie Recommendation
Engine
● Discuss future changes and what we have learned
Movie Recommendation Engine Technology Stack
Axios

User
Interacting with Neo4j: Key Points
● Interacting with Neo4j through transaction end points
○ Move to using node server connected via the Bolt driver
● Able to send multiple queries with one request
● Query design, balance query speed/complexity with number
of requests and amount of front end processing
○ For example - one query to return movies with associated genre rather
than one query per genre
State Management (1)
● Redux provides a system for state management
○ Immutable
● Data objects stored in a state tree which is updated via
Redux’s actions and reducers
● Demonstrate this for part of our application over the
next few slides
○ Focus on a small part of state tree
State Management (2)
● Application
initialises
○ Several data
objects
○ Focus on movies for
this example
State Management (3)
● Application
initialises
● Fetch movie data
from Neo4j
State Management (4)
● Application
initialises
● Fetch movie data
from Neo4j
● Movie data
processed, broken
down into genres
○ Data now available
in state
State Management (5)
● Application
initialises
● Fetch movie data
from Neo4j
● Move data
processed, broken
down into genres
○ Data available in
state
● User selects movie
State Management (5)
● Further actions
update the state
○ Movie review
○ Select another
movie
○ Log out
○ Log in
Future Stack
Joining the back and front end together helps us build out our application.

Ingest Data

Process Data Learn/Optimise

Store
Batch/Stream

Store

User
Extending the User Experience
● Currently have a very basic application
● Neo4j + GCP give us the ability to quickly iterate through new ideas using
a build, measure, learn methodology
○ Add new labels, nodes, relationships and properties to the graph
○ Ingest new data and add more ML
● User could add review comments
○ Sentiment analysis
○ Improve recommendation
● Users could add each other as friends
○ Improve recommendations
○ Push alerts
● Conversational UI/chatbot
● Live voting system
Live Demo
Thanks for listening
any questions?

Learning Graph DB in One Night - Neo4j - by Prashant Mudgal - Towards Data Science
No ratings yet
Learning Graph DB in One Night - Neo4j - by Prashant Mudgal - Towards Data Science
20 pages
Unit 4
No ratings yet
Unit 4
4 pages
Neo 4 J
100% (1)
Neo 4 J
4 pages
Neo4j Graph Database Guide
No ratings yet
Neo4j Graph Database Guide
11 pages
Neo4j Graph Database Overview
0% (1)
Neo4j Graph Database Overview
19 pages
Introduction To Data Science UNIT - IV
No ratings yet
Introduction To Data Science UNIT - IV
45 pages
2011 Webber-A Programmatic Introduction To Neo4j
No ratings yet
2011 Webber-A Programmatic Introduction To Neo4j
66 pages
INS Assignments
No ratings yet
INS Assignments
3 pages
Bda Experiment 3: Roll No. A-52 Name: Janmejay Patil Class: BE-A Batch: A3 Date of Experiment: Date of Submission Grade
No ratings yet
Bda Experiment 3: Roll No. A-52 Name: Janmejay Patil Class: BE-A Batch: A3 Date of Experiment: Date of Submission Grade
5 pages
GraphDB Recommendations en
No ratings yet
GraphDB Recommendations en
7 pages
Movie Recommendation System Using Graph Database
No ratings yet
Movie Recommendation System Using Graph Database
31 pages
Learning Guide 2: Nosql and Newsql: Cloud Computing Databases
No ratings yet
Learning Guide 2: Nosql and Newsql: Cloud Computing Databases
23 pages
Neo4j Cookbook - Sample Chapter
No ratings yet
Neo4j Cookbook - Sample Chapter
31 pages
Experiment No. 8: 1. Aim: 2. Objectives
No ratings yet
Experiment No. 8: 1. Aim: 2. Objectives
3 pages
Neo4j Graph Analytics
No ratings yet
Neo4j Graph Analytics
20 pages
Neo4j and Cypher
No ratings yet
Neo4j and Cypher
15 pages
Neo4j PDF
No ratings yet
Neo4j PDF
30 pages
Unit 5 Nosql
No ratings yet
Unit 5 Nosql
72 pages
Neo4j Use Case Social
No ratings yet
Neo4j Use Case Social
3 pages
R23 IDS Unit4 PPT - 2.0
No ratings yet
R23 IDS Unit4 PPT - 2.0
38 pages
Neo4j Graph Database Guide
No ratings yet
Neo4j Graph Database Guide
29 pages
Neo4j Database Practical Guide
No ratings yet
Neo4j Database Practical Guide
12 pages
Modeling A Recommendation Engine Workshop
No ratings yet
Modeling A Recommendation Engine Workshop
94 pages
Presentation ON Neo4J
No ratings yet
Presentation ON Neo4J
5 pages
Graph Database
No ratings yet
Graph Database
92 pages
Introduction To Neo4j
No ratings yet
Introduction To Neo4j
8 pages
Java and Neo4j Integration Guide
No ratings yet
Java and Neo4j Integration Guide
32 pages
Neo4j Notes
No ratings yet
Neo4j Notes
10 pages
ADO Lecture IX 2023-25
No ratings yet
ADO Lecture IX 2023-25
44 pages
Building Web Applications With Python and Neo4j - Sample Chapter
No ratings yet
Building Web Applications With Python and Neo4j - Sample Chapter
29 pages
5neo4jproductvisionandroadmapgraphsummitmilan Withdemo 4a64b1341 250425095401 641ea3cc
No ratings yet
5neo4jproductvisionandroadmapgraphsummitmilan Withdemo 4a64b1341 250425095401 641ea3cc
41 pages
Noslu 5 Edit
No ratings yet
Noslu 5 Edit
35 pages
M11a1 Final
No ratings yet
M11a1 Final
7 pages
Lecture02 GraphDatabases Neo4J PDF
No ratings yet
Lecture02 GraphDatabases Neo4J PDF
95 pages
Neo4j: Graph Database Essentials
No ratings yet
Neo4j: Graph Database Essentials
14 pages
Neo4j Graph Database Guide
No ratings yet
Neo4j Graph Database Guide
8 pages
Neo4jatlantagraphtalk09082016 160921141445
No ratings yet
Neo4jatlantagraphtalk09082016 160921141445
33 pages
Neo4j - Quick Guide
No ratings yet
Neo4j - Quick Guide
147 pages
BIG Data Analytics 21CSH-471: Computer Science & Engineering
No ratings yet
BIG Data Analytics 21CSH-471: Computer Science & Engineering
21 pages
Neo 4 J
No ratings yet
Neo 4 J
10 pages
Neo4j Manual Milestone
No ratings yet
Neo4j Manual Milestone
448 pages
Online AppQ HR Q1-Q30
No ratings yet
Online AppQ HR Q1-Q30
30 pages
SQL 7
No ratings yet
SQL 7
18 pages
CST8276 - Lab 10 - Working With Graph Databases
No ratings yet
CST8276 - Lab 10 - Working With Graph Databases
10 pages
NoSQL Module - 5
No ratings yet
NoSQL Module - 5
28 pages
Nosql Module5
No ratings yet
Nosql Module5
8 pages
NOSQL Micro Project
No ratings yet
NOSQL Micro Project
42 pages
Enhance RAG with Neo4j KG & Vector Search
No ratings yet
Enhance RAG with Neo4j KG & Vector Search
40 pages
To NEO4J: Abhishek Kumar
No ratings yet
To NEO4J: Abhishek Kumar
13 pages
NoSQL Database Document
No ratings yet
NoSQL Database Document
5 pages
Neo4j Sessio11 graphDataModeling
No ratings yet
Neo4j Sessio11 graphDataModeling
68 pages
Neo4j Manual PDF
No ratings yet
Neo4j Manual PDF
334 pages
Neo4j Manual
50% (2)
Neo4j Manual
529 pages
Neo4j: Leading Graph Database Guide
No ratings yet
Neo4j: Leading Graph Database Guide
16 pages
Seminar On Neo4j Data Model
No ratings yet
Seminar On Neo4j Data Model
5 pages
DBMS Unit4
No ratings yet
DBMS Unit4
28 pages
Introtoneo4jwebinar331 160331235041
No ratings yet
Introtoneo4jwebinar331 160331235041
117 pages
MW NR8120
No ratings yet
MW NR8120
17 pages
Making Machines See (Answer Key of Book Exercise)
100% (1)
Making Machines See (Answer Key of Book Exercise)
4 pages
Constant Pressure Control Unit: Type CPC
No ratings yet
Constant Pressure Control Unit: Type CPC
8 pages
OptiOne Service & Parts-E
100% (1)
OptiOne Service & Parts-E
334 pages
Additive Manufacturing
No ratings yet
Additive Manufacturing
25 pages
Bet-1 Computer Engineering Technology
No ratings yet
Bet-1 Computer Engineering Technology
33 pages
Voltagetestingsystemfinal Driescher
No ratings yet
Voltagetestingsystemfinal Driescher
12 pages
Rites Limited: (A Govt. of India Enterprise) RITES Bhawan, Plot No. 1, Sector - 29, Gurgaon - 122001
No ratings yet
Rites Limited: (A Govt. of India Enterprise) RITES Bhawan, Plot No. 1, Sector - 29, Gurgaon - 122001
6 pages
Business Management Professional
No ratings yet
Business Management Professional
1 page
Go Green Business Proposal Excutive Summary
94% (18)
Go Green Business Proposal Excutive Summary
27 pages
Data Science Notes - TutorialsDuniya
No ratings yet
Data Science Notes - TutorialsDuniya
59 pages
Cs 3 Series Nonelastomeric
No ratings yet
Cs 3 Series Nonelastomeric
2 pages
City Polytechnic High School Yplan Final Presentation
No ratings yet
City Polytechnic High School Yplan Final Presentation
23 pages
PL-300 Exam - Free Actual Q&As, Page 3 - ExamTopics3
No ratings yet
PL-300 Exam - Free Actual Q&As, Page 3 - ExamTopics3
9 pages
Optical Networks QUESTION BANK
0% (1)
Optical Networks QUESTION BANK
11 pages
Internship Report
No ratings yet
Internship Report
24 pages
Iptv Broadcaster
No ratings yet
Iptv Broadcaster
12 pages
SaudiAramco VendorsManufacturingPlant EvaluationQuestionnaire
No ratings yet
SaudiAramco VendorsManufacturingPlant EvaluationQuestionnaire
2 pages
IEC Standards List Overview
No ratings yet
IEC Standards List Overview
9 pages
MUC-LUC Unit Coolers Overview
No ratings yet
MUC-LUC Unit Coolers Overview
6 pages
MSCCS 105
No ratings yet
MSCCS 105
6 pages
Automann-564 55207
No ratings yet
Automann-564 55207
5 pages
1104 Troubleshooting PERKINS PDF
86% (37)
1104 Troubleshooting PERKINS PDF
220 pages
ANIRUDH JYOTHULA Resume
No ratings yet
ANIRUDH JYOTHULA Resume
2 pages
Payments Syllabus
No ratings yet
Payments Syllabus
5 pages
Louise Hewitt CV
No ratings yet
Louise Hewitt CV
2 pages
IoT Based Water Quality Monitoring System
100% (1)
IoT Based Water Quality Monitoring System
17 pages
Unit Lesson Plan
No ratings yet
Unit Lesson Plan
17 pages
Wiring Diagram: Anti-Lock Brake System (ABS) - Interlock System
100% (1)
Wiring Diagram: Anti-Lock Brake System (ABS) - Interlock System
1 page
IBM - Moving Ahead With Intelligent Automation
No ratings yet
IBM - Moving Ahead With Intelligent Automation
17 pages

Building Graphs

Uploaded by

Building Graphs

Uploaded by

Building Graph Applications with Neo4j

Neo4j Manchester Meetup

Leonhard Euler (1736) proved

‘You can’t do much

But you will have to watch Brad Pitt.

MATCH (p :Person) WHERE p.name = "Andrew Stanton" RETURN p;

MATCH (p :Person)-[r]->(m :Movie) WHERE p.name = "Andrew Stanton" RETURN p, r, m;

MATCH (u :User)-[r :RATED]->(m :Movie)

MATCH (u1 :User)-[r1 :RATED]->(m :Movie)<-[r2 :RATED]-(u2 :User)

Rating 1 Rating 2 Diff Diff Squared

Rating 1 Rating 2 Diff Diff Squared

// Update similarities between users

// Get nearest neighbors (lower the better)

1. User logs in and registers in the database

Lets see how that looks...

● More complex queries incorporating further information

● Google are working

Process Data Learn/Optimise

● How is this delivered to the user?

Process Data Learn/Optimise

You might also like