0% found this document useful (0 votes)

582 views7 pages

Assignment 11 Solution

The document contains an assignment on large language models with 10 questions focused on various aspects of knowledge graph completion and embedding techniques. It covers topics such as DistMult, ComplEx, hyperbolic embeddings, and order embeddings, providing correct answers and explanations for each question. The assignment aims to assess understanding of modifications and advantages of different embedding methods in handling hierarchical data and relationships.

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

582 views7 pages

Assignment 11 Solution

Uploaded by

Harsh Vardhan Choudhary

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Introduction to Large Language Models

Assignment- 11

Number of questions: 10 Total mark: 10 X 1 = 10

_________________________________________________________________________

QUESTION 1: [1 mark]

What is the main modification that SimplE makes to DistMult-like models to handle
asymmetric relations?

a. Replacing entity embeddings with random fixed vectors

b. Introducing separate entity embeddings for subject and object roles, along with
inverse relations
c. Restricting the rank of the relation tensor to 1
d. Using negative sampling for half of the triple set

Correct Answer: b

Explanation:

• SimplE extends the DistMult approach by using two different embeddings for each
entity: one as a subject (head) and one as an object (tail). It also learns inverse
relation embeddings to handle asymmetry.

• DistMult alone struggles with asymmetric relations because its scoring function
𝑠𝑐𝑜𝑟𝑒(𝑠, 𝑟, 𝑜) is symmetrical in 𝑠 and 𝑜. SimplE partially addresses this by modelling
subject/object roles distinctly and considering inverse relations.

_______________________________________________________________________

QUESTION 2: [1 mark]

Which statements correctly characterize the basic DistMult approach for knowledge graph
completion?

a. Each relation 𝑟 is parameterized by a full D×D matrix that can capture asymmetric
relations.
b. The relation embedding is a diagonal matrix, leading to a multiplicative interaction of
entity embeddings.
c. DistMult struggles with non-symmetric relations because score(s, r, o) = asT Mr ao is
inherently symmetric in s and o.
d. DistMult’s performance is typically tested only on fully symmetric KGs.

Correct Answer: b, c

Explanation:

• DistMult stands for “Diagonal bilinear model”, meaning each relation 𝑟 has a
diagonal embedding 𝑀! . This is effectively a vector multiplied element-wise (since a
diagonal matrix just scales each coordinate).
• This diagonal constraint leads to symmetric scoring: swapping 𝑠 and 𝑜 yields the
same value because of the multiplication pattern.

• Therefore:

o (b) is true: the relation embedding is diagonal.

o (c) is true: it struggles with asymmetric relations because 𝑎"# (𝑑𝑖𝑎𝑔(𝑟))𝑎$ is

symmetric in 𝑎" and 𝑎$ .

• (a) is false because DistMult does not use a full D×D matrix; it specifically uses a
diagonal matrix.

• (d) is also false: DistMult can be tested on general KGs (not only fully symmetric
ones), though it does poorly with strongly asymmetric relations.

_________________________________________________________________________

QUESTION 3: [1 mark]

Which statements about the ComplEx extension of DistMult are true?

a. It uses complex-valued embeddings to better capture asymmetric or anti-symmetric

relations.
b. It replaces the multiplication in DistMult with element-wise addition of real-valued
vectors.
c. For a perfectly symmetric relation, one could set the imaginary part of the relation
embedding to zero.
d. ComplEx requires each entity vector to be unit norm in the complex plane.

Correct Answer: a, c

Explanation:

ComplEx extends DistMult to the complex domain, allowing it to handle asymmetric

relations by using conjugate-based interactions. A typical ComplEx scoring function is: 𝑅𝑒(<
𝑎" , 𝑟, 𝑎
111$ >), where 𝑎" , 𝑟, 𝑎$ are complex vectors, and 111
𝑎$ is the complex conjugate of 𝑎$ .

• (a) True: The use of complex embeddings allows capturing asymmetry via imaginary
components and conjugation.
• (c) True: If the imaginary part is zero, the relation effectively behaves like DistMult in
the real domain, which is symmetric.

• (b) is false: It still uses multiplicative interactions, not element-wise addition.

• (d) is not necessarily required; while some implementations might normalize
embeddings, ComplEx does not strictly require each entity to have unit norm. It’s a
common training choice but not an inherent necessity.

_________________________________________________________________________

QUESTION 4: [1 mark]

Which best describes the main advantage of using a factorized representation (e.g.,
DistMult, ComplEx) for large KGs?

a. It enforces that every relation in the KG be perfectly symmetric.

b. It ensures each entity is stored as a one-hot vector, simplifying nearest-neighbour
queries.
c. It collapses the entire KG into a single scalar value.
d. It significantly reduces parameters and enables generalization to unseen triples by
capturing low-rank structure.

Correct Answer: d

Explanation:

• Factorized models (DistMult, ComplEx, etc.) leverage low-rank embeddings for

entities and relations, drastically reducing the parameters needed compared to
storing a full adjacency or full 3D tensor for the entire knowledge graph.

• This low-rank structure also helps the model learn patterns and generalize to
unseen triples (i.e., it can guess new edges that weren’t explicitly in training).

________________________________________________________________________

QUESTION 5: [1 mark]

Which statement best describes the reshaping of a 3D KG tensor 𝑋 ∈ 𝑅 |&|×|(|×|&| into a

matrix factorization problem?
a. One axis remains for subject, one axis remains for object, and relations are
combined into a single expanded axis.
b. The subject dimension is repeated to match the relation dimension, resulting in a 2D
matrix.
c. Each subject–relation pair is collapsed into a single dimension, while objects remain
as separate entries.
d. The entire KG is vectorized into a 1D array and then factorized with an SVD
approach.

Correct Answer: c

Explanation:
• A 3D KG tensor 𝑋 ∈ 𝑅|&|×|(|×|&| typically has subject-relation-object axes.

• One common approach is to “unfold” or “reshape” the tensor along one axis to
produce a matrix. E.g., you can merge the subject and relation dimensions into a
single dimension (subject-relation pairs) and keep the object axis separate, yielding a
2D matrix factorization problem.

• This approach is used in some methods to apply standard matrix factorization to the
KG data.

_________________________________________________________________________

QUESTION 6: [1 mark]

Which key property of hierarchical relationships (e.g. is-a, transitivity) motivates the
exploration of specialized embedding methods over standard Euclidean KG embeddings?

a. Symmetry in the relation (A, is-a, B) implying (B, is-a, A)

b. Frequent presence of cycles in hierarchical graphs
c. Transitivity in the form (camel, is-a, mammal) and (mammal, is-a, animal) ⟹
(camel,is-a,animal)
d. The high dimensionality of the entity embeddings

Correct Answer: c

Explanation:

• Hierarchical “is-a” relations typically exhibit transitivity: if A is-a B, and B is-a C,

then A is-a C. Standard Euclidean embeddings (like DistMult, ComplEx) do not
directly enforce or encourage transitivity.
• Specialized approaches (e.g., hyperbolic embeddings, order embeddings) do a better
job of naturally capturing such transitive relations.
_________________________________________________________________________

QUESTION 7: [1 mark]

Which of the following statements correctly describe hyperbolic (Poincare) embeddings for
hierarchical data?

a. They map nodes onto a disk (or ball) such that large branching factors can be
represented with lower distortion than in Euclidean space.
b. Distance grows slowly near the centre and becomes infinite near the boundary,
making it naturally suited for tree-like structures.
c. They require each node to be embedded on the surface of the Poincare disk of
radius 1.
d. They can achieve arbitrarily low distortion embeddings for trees with the same
dimension as Euclidean space.
Correct Answers: a, b

Explanation:

• Hyperbolic (Poincaré) embeddings place points in a disk (2D) or ball (higher-D)

where distances near the boundary can become very large, effectively allowing the
model to represent tree-like expansions with less distortion than Euclidean.

• Specifically:

o (a) True: Large branching structures are more compactly represented.

o (b) True: Distances blow up near the boundary, which helps model
hierarchical “depth.”

• (c) is not strictly correct: points reside within the disk, not necessarily on the
boundary (though some parameterizations might keep norms < 1).

• (d) is also not entirely correct in claiming “arbitrarily low distortion in the same
dimension.” Hyperbolic embeddings do better than Euclidean in capturing tree
metrics, but not everything is guaranteed “arbitrarily low” in the same dimension.
They do reduce distortion significantly compared to Euclidean though.

_________________________________________________________________________

QUESTION 8: [1 mark]

Why might a partial-order-based approach (like order embeddings) be beneficial for

modelling ‘is-a’ relationships compared to purely distance-based approaches?

a. They explicitly encode the ancestor–descendant relation as a coordinate-wise

inequality or containment.
b. They can represent negative correlations (i.e., sibling vs. ancestor) more easily than
distance metrics.
c. They inherently guarantee transitive closure of the hierarchy in the learned
embedding space.
d. They do not rely on pairwise distances but use a notion of coordinate-wise ordering
or interval containment.

Correct Answer: a, d

Explanation:

• Order embeddings interpret the “is-a” relationship as a partial order, typically using
coordinate-wise constraints (for instance, 𝑋 ≤ 𝑌 if 𝑥) ≤ 𝑦) ∀𝑖 ). This can capture
ancestor-descendant with a straightforward geometric interpretation (e.g., intervals,
boxes, or cones).
• They do not rely solely on a distance measure; they rely on the geometry of
“containment” or inequality in each coordinate.
• Hence, (a) and (d) are correct.

• (b) is not necessarily guaranteed or specifically a reason they do it more easily;

negative correlation is a different concept.

• (c) They do not automatically guarantee full transitive closure in the sense that it’s not
a built-in “guarantee.” The embeddings typically encourage partial ordering, but some
additional constraints or training objectives are needed.

_________________________________________________________________________

QUESTION 9: [1 mark]

Which statement about box embeddings in hierarchical modelling is most accurate?

a. Each entity or type is assigned a single real-valued vector, ignoring bounding

volumes.
b. Containment 𝐼* ⊆ 𝐼+ all dimensions encodes 𝑥 ≺ 𝑦 .
c. They rely on spherical distances around a central node to measure tree depth.
d. They cannot be used to represent set intersections or partial overlap.

Correct Answer: b

Explanation:

• Box Embeddings assign each concept or entity a “box” in some coordinate space,
typically described by two vectors 𝑙 ≤ 𝑢 (lower and upper corners).

• If box X is contained within box Y on every coordinate dimension (i.e., 𝑙, ≥ 𝑙- and

𝑢, ≤ 𝑢- ), that corresponds to an “is-a” or subset relationship.

• This approach can also represent partial overlaps (intersecting boxes) and so on, so
(d) is incorrect.

• (c) describes a more spherical or Euclidean method, not boxes.

• (a) is the opposite of the box approach (which does not ignore bounding volumes but
explicitly uses them).

_________________________________________________________________________

QUESTION 10: [1 mark]

What is a key challenge with axis-aligned open-cone (order) embeddings for hierarchical KG
data?

a. They enforce that all sibling categories have identical cone apices, which causes
overlap.
b. They require symmetrical relationships for all edges.
c. They do not allow partial orders to be extended to total orders.
d. The volume (measure) of cones is the same regardless of how “broad” or “narrow”
the cone is, making sub-categories indistinguishable by volume.

Correct Answer: d

Explanation:

• In axis-aligned cone embeddings, each entity is associated with an open cone in a

coordinate space. The cones can be wide or narrow, but the measure of the open
cone does not reflect “how big” or “small” the category is, which makes it difficult to
distinguish sub-categories based on “volume.”

• This is a known limitation: the geometry does not allow easily using volume-based
notions to reflect hierarchical or subset relationships beyond the angular direction.

_________________________________________________________________________

Lec 33
No ratings yet
Lec 33
18 pages
Assignment 9 Solution
No ratings yet
Assignment 9 Solution
7 pages
Discrete Structures Mock Exam
No ratings yet
Discrete Structures Mock Exam
9 pages
Machine Learning Final Exam
No ratings yet
Machine Learning Final Exam
19 pages
Quizzes: Module 1 - Formal Systems
No ratings yet
Quizzes: Module 1 - Formal Systems
24 pages
CS 224W Fall 2023 HW2
No ratings yet
CS 224W Fall 2023 HW2
11 pages
Quiz3 2024
No ratings yet
Quiz3 2024
2 pages
Quiz 1
No ratings yet
Quiz 1
3 pages
DM MCQs-1
No ratings yet
DM MCQs-1
37 pages
10 EST Solution
No ratings yet
10 EST Solution
16 pages
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
100% (1)
Exam Advanced Data Mining Date: 5-11-2009 Time: 14.00-17.00: General Remarks
5 pages
CS 717: Endsem
No ratings yet
CS 717: Endsem
5 pages
207B-Discrete Mathematics PDF
No ratings yet
207B-Discrete Mathematics PDF
18 pages
Gate 2025
No ratings yet
Gate 2025
33 pages
207C-Discrete Mathematics
No ratings yet
207C-Discrete Mathematics
22 pages
DM All
No ratings yet
DM All
1,731 pages
Discrete Mathematics MCQs
No ratings yet
Discrete Mathematics MCQs
56 pages
Midterm 2 S24
No ratings yet
Midterm 2 S24
6 pages
IML-IITKGP - Assignment 1 Solution
100% (1)
IML-IITKGP - Assignment 1 Solution
7 pages
2023 ML Assignment
No ratings yet
2023 ML Assignment
57 pages
ADMM For Combinatorial Graph Problems: Preprint
No ratings yet
ADMM For Combinatorial Graph Problems: Preprint
20 pages
Solution 11
No ratings yet
Solution 11
4 pages
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
No ratings yet
Solutions: 10-601 Machine Learning, Midterm Exam: Spring 2008 Solutions
8 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Top 40 Machine Learning Questions & Answers: Which of The Following Statement Is True in The Following Case?
No ratings yet
Top 40 Machine Learning Questions & Answers: Which of The Following Statement Is True in The Following Case?
34 pages
MCQ DM For Unit 2 Combinatorics
No ratings yet
MCQ DM For Unit 2 Combinatorics
22 pages
CS502 Full Material
100% (1)
CS502 Full Material
116 pages
2011 End Spring 2011 Computer Science Machine Learning
No ratings yet
2011 End Spring 2011 Computer Science Machine Learning
10 pages
CS514 2024fall Midterm
No ratings yet
CS514 2024fall Midterm
10 pages
KDD WS 24 25 Theory Tasks
No ratings yet
KDD WS 24 25 Theory Tasks
3 pages
Test2 Sample
No ratings yet
Test2 Sample
9 pages
Set 2 - Dbms MCQ
No ratings yet
Set 2 - Dbms MCQ
9 pages
MATH1005 Final Exam 2022
No ratings yet
MATH1005 Final Exam 2022
17 pages
Sample PDF
No ratings yet
Sample PDF
10 pages
Quiz 3: April 28, 2004
No ratings yet
Quiz 3: April 28, 2004
11 pages
DISCRETE Assignment - 2
No ratings yet
DISCRETE Assignment - 2
9 pages
Relation Embedding With Dihedral Group in Knowledge Graph: Canran Xu Ebay Inc. Ruijiang Li Ebay Inc
No ratings yet
Relation Embedding With Dihedral Group in Knowledge Graph: Canran Xu Ebay Inc. Ruijiang Li Ebay Inc
10 pages
CS201 Quiz 3
No ratings yet
CS201 Quiz 3
2 pages
SMAI-M20-06: Data, Distances and Learning: C. V. Jawahar
No ratings yet
SMAI-M20-06: Data, Distances and Learning: C. V. Jawahar
24 pages
Graph Theory Quiz
0% (1)
Graph Theory Quiz
30 pages
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
No ratings yet
Mock Final Examination Model Answer: Faculty of Computer Studies TM351 Data Management and Analysis
9 pages
Cos4852 2018 A1
No ratings yet
Cos4852 2018 A1
11 pages
hw2 Sols
No ratings yet
hw2 Sols
6 pages
Solution:: Quiz 3
No ratings yet
Solution:: Quiz 3
8 pages
Fundamental Algorithms, Assignment 12
No ratings yet
Fundamental Algorithms, Assignment 12
4 pages
Part 01
No ratings yet
Part 01
98 pages
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
No ratings yet
(Fall 2011) CS-402 Data Mining - Final Exam-SUB - v03
6 pages
DMS Question Paper
No ratings yet
DMS Question Paper
16 pages
Assignment PTRN RCGN
No ratings yet
Assignment PTRN RCGN
45 pages
cs228 HW 1
No ratings yet
cs228 HW 1
6 pages
22EE514
No ratings yet
22EE514
6 pages
Discrete Mathematics PDF
No ratings yet
Discrete Mathematics PDF
6 pages
10-601 Machine Learning: Homework 7: Instructions
No ratings yet
10-601 Machine Learning: Homework 7: Instructions
5 pages
DMML2025 Final Exam Sample
No ratings yet
DMML2025 Final Exam Sample
2 pages
Bits
No ratings yet
Bits
6 pages
Relational Algebra & Normalization Quiz
No ratings yet
Relational Algebra & Normalization Quiz
6 pages
TPR (Pass Year 2019)
No ratings yet
TPR (Pass Year 2019)
6 pages
Assignment 1: Introduction To Machine Learning Prof. B. Ravindran
100% (1)
Assignment 1: Introduction To Machine Learning Prof. B. Ravindran
4 pages
Algebraic Elements of Graphs 1st Edition Yanpei Liu Digital Version 2025
100% (1)
Algebraic Elements of Graphs 1st Edition Yanpei Liu Digital Version 2025
85 pages
LLM 1-11
No ratings yet
LLM 1-11
51 pages
Assignment 7 Solution
No ratings yet
Assignment 7 Solution
3 pages
Assignment 6 Solution
No ratings yet
Assignment 6 Solution
3 pages
Assignment 8 Solution
No ratings yet
Assignment 8 Solution
7 pages
Assignment 12 Solution
No ratings yet
Assignment 12 Solution
6 pages
Assignment 10 Solution
No ratings yet
Assignment 10 Solution
6 pages
Assignment 1 Solution
No ratings yet
Assignment 1 Solution
4 pages
Assignment 2 Solution
No ratings yet
Assignment 2 Solution
4 pages
Assignment 4 Solution
No ratings yet
Assignment 4 Solution
3 pages
Assignment 3 Solution
No ratings yet
Assignment 3 Solution
3 pages
Assignment 5 Solution
No ratings yet
Assignment 5 Solution
4 pages
04 - Trig Equations W Factoring + Fundamental Identities PDF
No ratings yet
04 - Trig Equations W Factoring + Fundamental Identities PDF
2 pages
Math101 Syllabus
No ratings yet
Math101 Syllabus
4 pages
MEC00 - Moment
No ratings yet
MEC00 - Moment
28 pages
6.4 - Quadratic Functions & Graphs of Functions
No ratings yet
6.4 - Quadratic Functions & Graphs of Functions
39 pages
Real Analysis Theory of Measure and Integration 3rd Edition J. Yeh Instant Download
100% (2)
Real Analysis Theory of Measure and Integration 3rd Edition J. Yeh Instant Download
61 pages
MTH 510 Outline
No ratings yet
MTH 510 Outline
2 pages
Kernel Canonical Correlation Analysis: Max Welling
No ratings yet
Kernel Canonical Correlation Analysis: Max Welling
3 pages
Coefficient Inequality For A Subclass of Analytic Functions Using Subordination Method With Extremal Function
No ratings yet
Coefficient Inequality For A Subclass of Analytic Functions Using Subordination Method With Extremal Function
7 pages
MTH 211 Set Theory & Abstract Algebra Guide
0% (1)
MTH 211 Set Theory & Abstract Algebra Guide
220 pages
Supremum and Infimum
No ratings yet
Supremum and Infimum
17 pages
3D Graphs of Transcendental Functions
No ratings yet
3D Graphs of Transcendental Functions
9 pages
MA Sample Paper 9 Unsolved
No ratings yet
MA Sample Paper 9 Unsolved
6 pages
Further Pure Mathematics F1: Pearson Edexcel
No ratings yet
Further Pure Mathematics F1: Pearson Edexcel
32 pages
MTH 101 Slides.6
No ratings yet
MTH 101 Slides.6
38 pages
Maths Content PDF
No ratings yet
Maths Content PDF
12 pages
1 Mathematical Methods Unit 3 Notes Jri
No ratings yet
1 Mathematical Methods Unit 3 Notes Jri
80 pages
S2 10 BasicWS 2B10 Sol e
No ratings yet
S2 10 BasicWS 2B10 Sol e
7 pages
Polyphase Merge
100% (1)
Polyphase Merge
9 pages
Algebraic Expression
100% (3)
Algebraic Expression
15 pages
2018 Free State Maths March QP
No ratings yet
2018 Free State Maths March QP
4 pages
Math For Innovative Minds 8 For Students: Unit I Expressions and Equations
No ratings yet
Math For Innovative Minds 8 For Students: Unit I Expressions and Equations
108 pages
The Finite Element Method: Lecture 7 Introductory 2-Dimensional Elastostatics Cont. (Ch. 8 in The Book)
No ratings yet
The Finite Element Method: Lecture 7 Introductory 2-Dimensional Elastostatics Cont. (Ch. 8 in The Book)
26 pages
AP, GP, Sigma 1
No ratings yet
AP, GP, Sigma 1
8 pages
FEM Code Development: Dr. Yijun Liu
No ratings yet
FEM Code Development: Dr. Yijun Liu
5 pages
ODE in Maple PDF
No ratings yet
ODE in Maple PDF
6 pages
Advanced Math Mock Test
No ratings yet
Advanced Math Mock Test
9 pages
MAT1001 Syllabus
No ratings yet
MAT1001 Syllabus
4 pages
Basic Calculus SSC
No ratings yet
Basic Calculus SSC
29 pages
Convolutional Codes With Sequential Decoding (Soft Decisions)
100% (6)
Convolutional Codes With Sequential Decoding (Soft Decisions)
33 pages
MATPMD2 Notes Session 2 With Additional Notes-4
No ratings yet
MATPMD2 Notes Session 2 With Additional Notes-4
8 pages

Assignment 11 Solution

Uploaded by

Assignment 11 Solution

Uploaded by

Introduction to Large Language Models

Number of questions: 10 Total mark: 10 X 1 = 10

a. Replacing entity embeddings with random fixed vectors

o (b) is true: the relation embedding is diagonal.

o (c) is true: it struggles with asymmetric relations because 𝑎"# (𝑑𝑖𝑎𝑔(𝑟))𝑎$ is

Which statements about the ComplEx extension of DistMult are true?

a. It uses complex-valued embeddings to better capture asymmetric or anti-symmetric

ComplEx extends DistMult to the complex domain, allowing it to handle asymmetric

• (b) is false: It still uses multiplicative interactions, not element-wise addition.

a. It enforces that every relation in the KG be perfectly symmetric.

• Factorized models (DistMult, ComplEx, etc.) leverage low-rank embeddings for

Which statement best describes the reshaping of a 3D KG tensor 𝑋 ∈ 𝑅 |&|×|(|×|&| into a

a. Symmetry in the relation (A, is-a, B) implying (B, is-a, A)

• Hierarchical “is-a” relations typically exhibit transitivity: if A is-a B, and B is-a C,

• Hyperbolic (Poincaré) embeddings place points in a disk (2D) or ball (higher-D)

o (a) True: Large branching structures are more compactly represented.

Why might a partial-order-based approach (like order embeddings) be beneficial for

a. They explicitly encode the ancestor–descendant relation as a coordinate-wise

• (b) is not necessarily guaranteed or specifically a reason they do it more easily;

Which statement about box embeddings in hierarchical modelling is most accurate?

a. Each entity or type is assigned a single real-valued vector, ignoring bounding

• If box X is contained within box Y on every coordinate dimension (i.e., 𝑙, ≥ 𝑙- and

• (c) describes a more spherical or Euclidean method, not boxes.

QUESTION 10: [1 mark]

• In axis-aligned cone embeddings, each entity is associated with an open cone in a

You might also like