E5A6 good-turing-smoothing · GitHub Topics · GitHub
[go: up one dir, main page]

Skip to content
#

good-turing-smoothing

Here are 20 public repositories matching this topic...

A coursework-style project from my MSc Machine Learning on Big Data (University of East London), using PySpark to compute word frequency distributions on a large English corpus (~9.5 million words) and to compare frequency estimates from small samples against the full dataset.

  • Updated Nov 19, 2025
  • Python

Improve this page

Add a description, image, and links to the good-turing-smoothing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the good-turing-smoothing topic, visit your repo's landing page and select "manage topics."

Learn more

0