0% found this document useful (0 votes)

5 views6 pages

How To Write Complex Questions

The document outlines how to formulate complex questions that challenge language models, particularly in coding environments. It identifies common weaknesses of LLMs, such as handling cross-file dependencies and cyclic imports, and provides a step-by-step guide for understanding code repositories to create effective questions. Additionally, it emphasizes the importance of crafting realistic and contextually relevant questions that require synthesis of information from multiple files.

Uploaded by

Mr Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views6 pages

How To Write Complex Questions

Uploaded by

Mr Muhammad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Ballerina Capuchina: How to write complex Questions.

What do we consider a good question?

A good question isn’t necessarily one that's difficult for a human developer—it’s one that
exposes real challenges for current language models. The goal isn't to maximize complexity, but
to target areas where even the best LLMs tend to struggle.

Complex question in coding environments:

Think about common LLM weaknesses when working with production code—like losing context
across files, failing to link related components in different modules, or misunderstanding
config-to-r

💥 Hard Question Types for LLMs on GitHub Repos

Introduction

Below are real-world question types that LLMs often fail to solve well—and why.

1. Cross-File “Glue” Questions

Prompt Example:

“Can you trace how a request handled in api/order.js ultimately triggers the
email-sending logic in mailer/sendInvoice.ts, and explain how data flows between
these modules—including any intermediate services, function calls, or shared utilities
involved in the process?”

Why does it fails: The model must reason across multiple files and track a call chain.
With limited context, it may miss or misinterpret the link between modules.

2. Cyclic Imports or Dependency Graph Fixes

Prompt Example:

“How would you refactor the Go packages to break the cyclic dependency between
model/user.go and state/session.go, particularly around the NewUser() and
InitSession() functions, and what structural changes would preserve their behavior
while decoupling their imports?”

Why does it fails: Requires global insight into package structure. Many LLMs fail to
suggest viable restructuring (e.g., introducing a new shared module), even though that’s
a known pattern among experienced devs.
3. Dynamic Analyses (AST, PropTypes, Runtime Checks)

Prompt Example:

“What could cause the React prop-types checker to flag only during CI runs, and how
might differences in environment, build configuration (e.g., .babelrc,
webpack.config.js), or module resolution impact the behavior of prop-types validation
in lib/rules/prop-types.js when analyzing components/Card.jsx during
production versus local development?”

Why does it fails: Requires deep knowledge of AST traversal + runtime execution + test
orchestration. LLMs struggle to combine these layers without concrete traces.

Step-by-Step on how to discover a complex question.

This section offers a friendly, practical guide to help you

begin understanding the context and structure of the
repository you're working with.
🧐 1. Understand the Lay of the Land: Build a Mental Model
👀 2. Dig Deeper: Map Cross-File Relationships
🧠 3. Brainstorm and Formulate Your Question
1. Understand the Lay of the Land: Build a Mental Model

Before you can ask a good question, you need to know what you're looking at.

● Get a Quick Overview: Use Cursor to get a high-level summary of the repository.
What are the main directories? What are the key components, and how do they
interact?
● Scan the Directory Tree: Identify the major areas of the codebase, such as the API,
UI, data handling, tests, and build processes.
● Identify Key Structures: Look for public APIs, class hierarchies, shared utilities,
and points where dependencies are injected.
● Trace a Workflow: Follow at least one complete data or control flow. For example,
trace a user request from the handler to the service that interacts with the
database.

Review Recent Changes: Look at your given PR. These often reveal hidden connections
and dependencies between different parts of the code. This can help you determine
which collection of files would work best for your question.
2. Dig Deeper: Map Cross-File Relationships

Understand how different parts of the codebase connect with each other. For example,
look for:

● Function Calls: Where does a function in one file call a function in another?
● Shared Information: Is there shared state or configuration, like environment
variables or global settings?
● Interfaces and Implementations: Where are interfaces defined, and where are they
implemented?

Data Models: Find where data models are defined and then see how they are used in
other places like migrations, serializers, or tests.

3. Brainstorm and Formulate Your Question

Now that you have a good understanding of the code, you can start thinking about what
to ask.

● Think Like a Hacker or Tester:

○ What happens if a function returns an unexpected value?
○ What if two different commits change the same configuration in conflicting
ways?
○ Could an old, outdated dependency cause a problem with a new feature?
○ Come up with three to five realistic scenarios that involve multiple files.
● Craft a Strong Question: A good question will:
○ Set the Scene: "Suppose the UserService.create() function returns None..."
○ Point to the Right Places / Reference Multiple Files: "...explain how that
affects order_processor.py and notification_mailer.go."
○ Be Answerable: Ensure the question can be answered using only the
provided code and its history.

Discuss the interaction between functions of multiple files: It will be easier to stump the
model if your question references an interaction between a Class/Function or
Datastructure between different files.
Example Approach on how to come up with good questions

Example Foundational Questions to Get Started

If you're just starting to explore a repository, here are some good initial questions to ask:

● What is the main purpose of this repository?

● What is its primary function?
● What are the most important directories?
● Can you give me a detailed explanation of how the top three key functionalities in
lib/util are used, with examples?

🚨🚨🚨 ** Once you have come up with the question you will use in your
task → Remember to always start a new chat before asking the question
you will use in your task.**
Assess your question before starting

GENERAL INSTRUCTIONS

📌 Focus on different criteria related to the repository and the code functionalities in the
code source files. **Check the **Question Styles and Diversity section for different types
of questions you may ask in different tasks.

📌 Try to ask hard questions that will stump the model. That is, questions that the model
is not able to provide a full answer.

📌
** **Harder prompts rely on data from multiple parts of the codebase that require the
agent to synthesize knowledge from multiple files, especially how multiple files
interact. The questions should be about the code in the repository. For example, if you’re
working on the Pandas repository, you should ask about the implementation of Pandas
as if you were a developer working on the Pandas implementation, not something like
“how do I create a DataFrame in Pandas”

📌 The questions need to be realistic (NO FANCY FORMATTING such as markdown).

● ❌ Avoid backticks and markdown, such as ###, in the questions.
● ❌ Don’t just copy the question examples as templates. Be creative.
● ❌ Don’t be formal. Use casual language.
● ❌ Avoid writing questions that look like GitHub issue descriptions: Think about
what a developer may ask a coding agent when using a repository.

If your task has an issue description with replication code, you can use that as
INSPIRATION for the files and modules you may want to mention in your question.

❌
● Be precise: Try to match the level of precision you’d normally provide when
prompting an LLM, but do not leak the PR solution if you are using the PR as
inspiration!
● For debugging-type questions, describe or mention the relevant existing code or
functionality if needed: The questions should include the relevant information for
the Agent to address the problem.

Reference relevant files: Reference other files by using the @file_name convention in
Cursor that provides that file as context.

Quick Checklist for a Good Question

Use this checklist to make sure your question is solid:

● ✅ Does my question involve an interaction between at least two/three different

files?
● ✅ Does answering the question require combining information from those
✅ Is the scenario I'm presenting realistic and based on how the repository
different files?
●

✅ Is it completely clear what the person answering the question needs to

actually works?
●
provide?

q1 Merged
No ratings yet
q1 Merged
10 pages
This Section Offers A Friendly, Practical Guide To Help You Begin Understanding The Context and Structure of The Repository You're Working With
No ratings yet
This Section Offers A Friendly, Practical Guide To Help You Begin Understanding The Context and Structure of The Repository You're Working With
3 pages
Assess Your Question Before Starting: Question Styles and Diversity
No ratings yet
Assess Your Question Before Starting: Question Styles and Diversity
3 pages
What Do We Consider A Good Question?: Common LLM Failure Modes in Github Code Analysis
No ratings yet
What Do We Consider A Good Question?: Common LLM Failure Modes in Github Code Analysis
2 pages
GPT 5 Agent Prompts
No ratings yet
GPT 5 Agent Prompts
6 pages
Hard Question Types For Llms On Github Repos: 1. Cross-File "Glue" Questions
No ratings yet
Hard Question Types For Llms On Github Repos: 1. Cross-File "Glue" Questions
2 pages
How To Open Source-Cheatsheets
No ratings yet
How To Open Source-Cheatsheets
6 pages
Python Like PRO Light Mode
No ratings yet
Python Like PRO Light Mode
36 pages
Mock Interview - Nati D
No ratings yet
Mock Interview - Nati D
4 pages
ChatGPT Cheat Sheet Devs v2 2
No ratings yet
ChatGPT Cheat Sheet Devs v2 2
1 page
Requirements Tool Use
No ratings yet
Requirements Tool Use
2 pages
General Material
No ratings yet
General Material
16 pages
Basic Interview Questions and Sample Answers
No ratings yet
Basic Interview Questions and Sample Answers
4 pages
Tech Lead Screening Questions
No ratings yet
Tech Lead Screening Questions
6 pages
Amazon Interview QNS
No ratings yet
Amazon Interview QNS
3 pages
Interview Questions by Skill
No ratings yet
Interview Questions by Skill
24 pages
Python Developer Interview Playbook Full
No ratings yet
Python Developer Interview Playbook Full
6 pages
Detailed LangChain Interview Questions
No ratings yet
Detailed LangChain Interview Questions
4 pages
Chat GPT
No ratings yet
Chat GPT
39 pages
Agent Prompt v1.2
No ratings yet
Agent Prompt v1.2
14 pages
3 Python Self Assessment
No ratings yet
3 Python Self Assessment
2 pages
(Turing) Guidelines For Python Puzzles (March 2024)
No ratings yet
(Turing) Guidelines For Python Puzzles (March 2024)
11 pages
Genai Premlim Eval
No ratings yet
Genai Premlim Eval
6 pages
Python Interview Summary
No ratings yet
Python Interview Summary
5 pages
Big Tech Job Guide for CS Students
No ratings yet
Big Tech Job Guide for CS Students
32 pages
Frontend Guide Material by Arun M
No ratings yet
Frontend Guide Material by Arun M
16 pages
Phase 1: Python Learning Path
No ratings yet
Phase 1: Python Learning Path
31 pages
ML Interview Preparation Schedule
No ratings yet
ML Interview Preparation Schedule
242 pages
Interview Questions UBS
No ratings yet
Interview Questions UBS
7 pages
Full-Stack Developer Interview Questions - ReactJS, JavaScript, Node - JS, MERN, HTML, CSS
No ratings yet
Full-Stack Developer Interview Questions - ReactJS, JavaScript, Node - JS, MERN, HTML, CSS
15 pages
Python Golang Interview Questions
No ratings yet
Python Golang Interview Questions
2 pages
Resume Allllll
No ratings yet
Resume Allllll
20 pages
Mern Job Interview Questions
No ratings yet
Mern Job Interview Questions
4 pages
Software Engineering Experience
No ratings yet
Software Engineering Experience
6 pages
Software Engineering Assignment
No ratings yet
Software Engineering Assignment
5 pages
Claude 4 Sonnet Agent Prompts
No ratings yet
Claude 4 Sonnet Agent Prompts
4 pages
TOC Python
No ratings yet
TOC Python
10 pages
6 KP VJ ZFQJ GC ENpi W55 S KWN
No ratings yet
6 KP VJ ZFQJ GC ENpi W55 S KWN
16 pages
Questions
No ratings yet
Questions
6 pages
Tech Questions
No ratings yet
Tech Questions
3 pages
Agent Prompt
No ratings yet
Agent Prompt
8 pages
Python
No ratings yet
Python
6 pages
Sorted Interview Qsns
No ratings yet
Sorted Interview Qsns
10 pages
Summer Vacation Assignment
No ratings yet
Summer Vacation Assignment
3 pages
01 Getting Started
No ratings yet
01 Getting Started
4 pages
HAv2 Write The Requirement
No ratings yet
HAv2 Write The Requirement
5 pages
Software Developement Prompts
No ratings yet
Software Developement Prompts
14 pages
Untitled Document
No ratings yet
Untitled Document
10 pages
Chatgpt Interview Questions
No ratings yet
Chatgpt Interview Questions
25 pages
Intern Interview Questions (3 Copies)
No ratings yet
Intern Interview Questions (3 Copies)
1 page
HAv2 PR and Problem Statement
No ratings yet
HAv2 PR and Problem Statement
10 pages
QNA
No ratings yet
QNA
6 pages
Frontend Interview Prep Guide
No ratings yet
Frontend Interview Prep Guide
17 pages
MERN & React Interview Prep Guide
No ratings yet
MERN & React Interview Prep Guide
52 pages
Random Text File 2
No ratings yet
Random Text File 2
1 page
Jag An Report
No ratings yet
Jag An Report
13 pages
Testing Frameworks
No ratings yet
Testing Frameworks
1 page
AskTheCode - Git Companion
No ratings yet
AskTheCode - Git Companion
4 pages
Microsoft Office 2004 For Mac Resource Kit
No ratings yet
Microsoft Office 2004 For Mac Resource Kit
109 pages
Designing The User Interface: Text, Colour, Images, Moving Images and Sound
No ratings yet
Designing The User Interface: Text, Colour, Images, Moving Images and Sound
34 pages
Safety and Handling
No ratings yet
Safety and Handling
22 pages
SAP BW Business Content Analysis
No ratings yet
SAP BW Business Content Analysis
51 pages
Webtec (Unit - 1) Notes
No ratings yet
Webtec (Unit - 1) Notes
37 pages
Web App User Interaction Diagrams
No ratings yet
Web App User Interaction Diagrams
5 pages
CV Samuel Christoper - S2-2
No ratings yet
CV Samuel Christoper - S2-2
1 page
MPMC
No ratings yet
MPMC
19 pages
Embedded Night-Vision System For Pedestrian Detection - Doc
No ratings yet
Embedded Night-Vision System For Pedestrian Detection - Doc
61 pages
C# Operators & Type Safety Guide
No ratings yet
C# Operators & Type Safety Guide
45 pages
Malware Detection with ML Analysis
No ratings yet
Malware Detection with ML Analysis
5 pages
Create A (?T-REX) Blox Fruits Tier LIST!? (Fruits) Tier List - TierMaker
No ratings yet
Create A (?T-REX) Blox Fruits Tier LIST!? (Fruits) Tier List - TierMaker
1 page
Building An Online Shopping Cart Using C Sharp Part 2
No ratings yet
Building An Online Shopping Cart Using C Sharp Part 2
17 pages
Python - 6-OTHER CONTROLS LISTBOX, CHECKBOX, COMBOBOX
No ratings yet
Python - 6-OTHER CONTROLS LISTBOX, CHECKBOX, COMBOBOX
17 pages
DIY Arduino Drone Guide
No ratings yet
DIY Arduino Drone Guide
30 pages
Microsoft PowerPoint - Wikipedia, The Free Encyclopedia
No ratings yet
Microsoft PowerPoint - Wikipedia, The Free Encyclopedia
8 pages
1.2 File Handling Updated
No ratings yet
1.2 File Handling Updated
11 pages
Manual Sinamics
100% (1)
Manual Sinamics
346 pages
OS - Unit 4
No ratings yet
OS - Unit 4
55 pages
KDH309 T6 and Above
No ratings yet
KDH309 T6 and Above
21 pages
Revised Data Gathering
No ratings yet
Revised Data Gathering
34 pages
Codecademy Cookie Details
No ratings yet
Codecademy Cookie Details
9 pages
Ensayo Sobre El Comercio de Esclavos
100% (1)
Ensayo Sobre El Comercio de Esclavos
4 pages
PRG510S LAB 05: Mark: 70 Gender: Female Age: 21
100% (1)
PRG510S LAB 05: Mark: 70 Gender: Female Age: 21
4 pages
Installing The ROSS 5D Client App
No ratings yet
Installing The ROSS 5D Client App
8 pages
Yokogawa India Work Procedure: Revisions
No ratings yet
Yokogawa India Work Procedure: Revisions
5 pages
Multilingual Technical Translator & Editor
No ratings yet
Multilingual Technical Translator & Editor
4 pages
التفسير الحلاج
100% (1)
التفسير الحلاج
49 pages
Chetan IBM Maximo Java J2EE
No ratings yet
Chetan IBM Maximo Java J2EE
6 pages

How To Write Complex Questions

Uploaded by

How To Write Complex Questions

Uploaded by

Ballerina Capuchina: How to write complex Questions.

What do we consider a good question?

Complex question in coding environments:​

💥 Hard Question Types for LLMs on GitHub Repos

1. Cross-File “Glue” Questions

2. Cyclic Imports or Dependency Graph Fixes

Step-by-Step on how to discover a complex question.

This section offers a friendly, practical guide to help you

3. Brainstorm and Formulate Your Question

●​ Think Like a Hacker or Tester:

Example Foundational Questions to Get Started

●​ What is the main purpose of this repository?

📌 The questions need to be realistic (NO FANCY FORMATTING such as markdown).

Quick Checklist for a Good Question

Use this checklist to make sure your question is solid:

●​ ✅ Does my question involve an interaction between at least two/three different

✅ Is it completely clear what the person answering the question needs to

You might also like

Complex question in coding environments:

● Think Like a Hacker or Tester:

● What is the main purpose of this repository?

● ✅ Does my question involve an interaction between at least two/three different