GSoC/GCI Archive
Google Summer of Code 2013 Python Software Foundation

scikit-learn: Biclustering algorithms, scoring, and data generation

by Kemal Eren for Python Software Foundation

Biclustering is an unsupervised learning method that simultaneously clusters samples and features. It has become a popular technique in various fields including bioinformatics and text mining. I propose to add biclustering capabilities to scikit-learn by implementing popular algorithms, scoring metrics, and data generation functions.