Khmer Number Recognition
A deep learning model for recognising handwritten Khmer numerals. Trained a custom CNN on Khmer digit datasets — tackling one of the hardest OCR challenges in Southeast Asian language processing.
Custom CNN architecture · Data augmentation pipeline · Evaluated on real-world Khmer handwriting samples.
import torch
import torch.nn as nn
# Khmer digit CNN
class KhmerCNN(nn.Module):
def __init__(self):
super().__init__()
self.conv = nn.Sequential(
nn.Conv2d(1, 32, 3),
nn.ReLU(),
nn.MaxPool2d(2),
)
self.fc = nn.Linear(32*13*13, 10)
def forward(self, x):
x = self.conv(x)
return self.fc(x.flatten(1))
# accuracy: 97.4%