Copyright (c) 2006 by
Xuan-Hieu Phan (pxhieu at gmail dot com), Graduate School of Information Sciences, Tohoku University
CRFChunker: A Java-based Conditional Random Fields Phrase Chunker (Phrase Chunking Tool) for English that was built upon FlexCRFs. The model was trained on sections 01..24 of WSJ corpus and using section 00 as the development test set (F1-score of 95.77). Chunking speed: 700 sentences / second.
Researches using this tool for running experiments should include the following citation:
Xuan-Hieu Phan, "CRFChunker: CRF English Phrase Chunker", http://crfchunker.sourceforge.net/, 2006.
We would like to thank professor Tu-Bao Ho for providing us Penn Treebank data for training the POS tagging model. We would also like to thank Sourceforge.net for hosting this project.
Last updated: March 24, 2007