CRFChunker: CRF English Phrase Chunker

(built upon FlexCRFs)


Copyright (c) 2006 by

Xuan-Hieu Phan (pxhieu at gmail dot com), Graduate School of Information Sciences, Tohoku University

CRFChunker: A Java-based Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English that was built upon FlexCRFs. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences / second.


Related links:

Researches using this tool for running experiments should include the following citation:

Xuan-Hieu Phan, "CRFChunker: CRF English Phrase Chunker",, 2006.

We would like to thank professor Tu-Bao Ho for providing us Penn Treebank data for training the POS tagging model. We would also like to thank for hosting this project.

Last updated: March 24, 2007