会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 8. 发明申请
    • METHOD AND SYSTEM FOR DATA EXTRACTION FROM IMAGES OF SEMI-STRUCTURED DOCUMENTS
    • 用于数据提取的方法和系统从半结构化文档的图像中提取
    • US20170068866A1
    • 2017-03-09
    • US14868683
    • 2015-09-29
    • ABBYY Development LLC
    • Mikhail Kostyukov
    • G06K9/18G06K9/46
    • G06K9/18G06K9/00449G06K9/00456G06K9/4604G06K2209/01
    • The present invention is directed to a method of extracting data from fields in an image of a document. In one implementation, a text representation of the image of the document is obtained. A graph for storing features of the text fragments in the text representation of the image of the document and their links is constructed. A cascade classification for computing the features of the text fragments in the text representation of the image of the document and their link is run. Hypotheses about the belonging of text fragments to the fields in the image of the document are generated. Combinations of the hypotheses are generated. A combination of the hypotheses is selected. And data from the fields in the image of the document is extracted based on the selected combination of the hypotheses.
    • 本发明涉及一种从文档图像中的字段提取数据的方法。 在一个实现中,获得文档的图像的文本表示。 用于存储文本图像的文本表示中的文本片段的特征及其链接的图形被构造。 运行用于计算文档图像的文本表示中的文本片段的特征及其链接的级联分类。 生成关于文档图像中的文本片段归属的假设。 产生假设的组合。 选择假设的组合。 并且基于所选择的假设的组合来提取文档图像中的字段的数据。