会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Identifying training documents for a content classifier
    • 识别内容分类器的培训文档
    • US08352386B2
    • 2013-01-08
    • US12497467
    • 2009-07-02
    • Srinivas Varma ChitiveliBarton Wayne EmanuelAlexander Wolcott HoltMichael E. Moran
    • Srinivas Varma ChitiveliBarton Wayne EmanuelAlexander Wolcott HoltMichael E. Moran
    • G06F15/18
    • G06N99/005G06F17/30707
    • Systems, methods and articles of manufacture are disclosed for identifying a training document for a content classifier. One or more thresholds may be defined for designating a document as a training document for a content classifier. A plurality of documents may be evaluated to compute a score for each respective document. The score may represent suitability of a document for training the content classifier with respect to a category. The score may be computed based on content of the plurality of documents, metadata of the plurality of documents, link structure of the plurality of documents, user feedback (e.g., user supplied document tags) received for the plurality of documents, and document metrics received for the plurality of documents. Based on the computed scores, a training document may be selected. The content classifier may be trained using the selected training document.
    • 公开了用于识别内容分类器的训练文档的系统,方法和制品。 可以定义一个或多个阈值来指定文档作为内容分类器的训练文档。 可以评估多个文档以计算每个相应文档的得分。 该分数可以表示用于针对类别来训练内容分类器的文档的适合性。 可以基于多个文档的内容,多个文档的元数据,多个文档的链接结构,为多个文档接收的用户反馈(例如,用户提供的文档标签)以及接收到的文档度量来计算分数 用于多个文档。 基于计算出的分数,可以选择训练文档。 内容分类器可以使用所选择的训练文档进行训练。