8000 Discover gists · GitHub
[go: up one dir, main page]

Skip to content

Instantly share code, notes, and snippets.

Confidence-Aware Decision Benchmark (CAD-B)

Operationally-defined evaluation of uncertainty-guided adaptive behavior in LLMs


Motivation

Modern LLMs exhibit poor calibration between confidence and accuracy (Guo et al., 2017). CAD-B tests whether models generate internal uncertainty signals that: (1) prospectively predict errors, and (2) modulate decision-making adaptively. Adapts paradigms from comparative cognition (Smith et al., 2003; Hampton, 2001; Kornell et al., 2007).

👋 Hi, I'm Maruf Ahmed

Full Stack Developer | React & React Native | Python | AI & Automation
📍 Dhaka, Bangladesh
📫 ahmedmaruf474@gmail.com
🔗 LinkedIn | GitHub


🧠 About Me

@Cayke
Cayke / extensions.json
Created October 6, 2025 23:36
VS Code Python Settings - Ruff + Pylance + Pytest (2025)
{
"recommendations": [
"ms-python.python",
"charliermarsh.ruff",
"github.vscode-github-actions",
"eamodio.gitlens",
"anysphere.cursorpyright"
]
}
@love1980s
love1980s / cf_record_id.txt
Created October 6, 2025 23:35
Temporary Cloudflare record ID
0c0f8d66cd47e0cbd6e7e4c5793c8008
@putinnet
putinnet / hello.txt
Created October 6, 2025 23:34
Example Gist created via TOR
Hello from Node.js via TOR!
PS C:\TICK\XZ> python main.py
2025-10-06 20:32:38,018 - digit_analyzer - INFO - DigitAnalyzer inicializado: min_ticks=5
2025-10-06 20:32:38,019 - rules_engine - INFO - RulesEngine XZ inicializado
2025-10-06 20:32:38,019 - trade_executor - INFO - TradeExecutor inicializado: Stake=0.35, Martingale=2x2.0, StopLoss=10.0
2025-10-06 20:32:38,019 - __main__ - INFO - XZ Bot inicializado
2025-10-06 20:32:38,019 - __main__ - INFO - Modo: DRY RUN (Simulação)
2025-10-06 20:32:38,020 - __main__ - INFO - ============================================================
2025-10-06 20:32:38,020 - __main__ - INFO - INICIANDO BOT XZ
2025-10-06 20:32:38,020 - __main__ - INFO - ============================================================
2025-10-06 20:32:38,020 - __main__ - INFO - Símbolo: R_100
{
"name": "Shopify Order Simulation",
"nodes": [
{
"parameters": {
"httpMethod": "POST",
"path": "shopify-order-sim",
"options": { "responseData": "Order received successfully" }
},
"id": "Webhook_1",
@HugsLibRecordKeeper
HugsLibRecordKeeper / output_log.txt
Created October 6, 2025 23:33
Rimworld output log published using HugsLib
Log uploaded on Tuesday, October 7, 2025, 8:33:33 AM
Loaded mods:
Harmony(brrainz.harmony)[mv:2.4.2.0]: 0Harmony(2.4.1), HarmonyMod(2.4.2)
Core(Ludeon.RimWorld): (no assemblies)
Biotech(Ludeon.RimWorld.Biotech): (no assemblies)
EdB Prepare Carefully(EdB.PrepareCarefully)[mv:1.6.2]: EdBPrepareCarefully(av:1.1.1,fv:1.6.2)
EdB Prepare Carefully 日本語翻訳追加(Proxyer.JPT.PrepareCarefullyJP): (no assemblies)
Map Preview(m00nl1ght.MapPreview): LunarLoader(1.1.16), LunarFramework(1.1.16), HugsLogPublisher(2.1.8), MapPreview(1.12.22), MapPreviewMod(1.12.22)
HugsLib(UnlimitedHugs.HugsLib)[ov:12.0.1]: 0Harmony(av:2.4.1,fv:1.2.0.1), HugsLib(av:1.0.0,fv:11.0.5)
Map Preview 日本語翻訳追加(proxyer.JPT.MapPreview): (no assemblies)
@s1oane
s1oane / codeforsound-oct6.js
Created October 6, 2025 23:33
Fade in filter
synth = Synth('saw')
filter = Filter({ type:'lowpass', cutoff:0.2, resonance:8 })
// filter
synth.fx.add(filter)
filter.cutoff.fade(0, 1, 6) // start, end, duration
// notes
synth.note.seq( [0,2,4,7], 1/2 )
0