Constructing Chinese question classification dataset with active learning
CSTR:
Author:
Affiliation:

Clc Number:

TP391

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    The current corpora of question classification are relatively small and difficult to meet the practical needs of Question Answering system, so that we use active learning methods to construct a Chinese question classification dataset and for question labeling. In addition, we improve the performance of labeling with fea- ture selection. Experimental results show that by using active learning we can quickly converge at the best ac- curacy (85% ) and by using manual tagging we can have small feature set size. The active learning-based la- beling method achieved very good classification performance with less manual annotation tagging, which can significantly improve the accuracy of classification to some degree

    Reference
    Related
    Cited by
Get Citation
Related Videos

Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: May 31,2012
  • Published:
Article QR Code