会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • AUTOMATED UNIT FINDING FOR NUMERIC INFORMATION RETRIEVAL
    • 自动化单元查找数字信息检索
    • US20110246458A1
    • 2011-10-06
    • US12754249
    • 2010-04-05
    • Ari TuchmanJohn Stockton
    • Ari TuchmanJohn Stockton
    • G06F17/30
    • G06F17/30864
    • The present invention is related to the task of retrieving numeric information in response to a textual keyword-based query by automatically associating a unit to the type of data being retrieved. An information retrieval system is presented which suggests a unit for data exploration by leveraging the local environment of numeric data across the corpus. This local environment is parsed, including through natural language processing and proximity-based techniques, to determine units relevant to particular keyword phrases. The system also relies on knowledge of semantically and scientifically related units to optimize their binning for suggested unit scoring.
    • 本发明涉及通过将单元自动关联到正被检索的数据的类型来响应于基于文本关键词的查询来检索数字信息的任务。 提出了一种信息检索系统,其通过利用语料库中的数字数据的局部环境来建议用于数据挖掘的单元。 这种本地环境被解析,包括通过自然语言处理和基于邻近的技术来确定与特定关键词短语相关的单元。 系统还依赖于语义和科学相关单位的知识,以优化其建议单位得分的分档。
    • 6. 发明授权
    • System and methods for units-based numeric information retrieval
    • 基于单位数字信息检索的系统和方法
    • US08756229B2
    • 2014-06-17
    • US12496199
    • 2009-07-01
    • John K. StocktonAri K. Tuchman
    • John K. StocktonAri K. Tuchman
    • G06F17/30
    • G06F17/30619G06F17/30011G06F17/30616G06F17/30675G06F17/3069G06F17/30696G06F2216/11
    • An information retrieval and analysis system for numeric data which provides high precision and recall for numeric search and uses a methodology for determining contextualization of the extracted data. The capabilities include extracting, parsing, and contextualizing numeric data including both a numeric value and an accompanying unit. This system facilitates the organization of largely unstructured numeric data into an inverted index and other database formats. An information retrieval system which enables the exploration and refinement of an extracted numeric data set defined by a search input that may be precise or initially vague. This system also facilitates analyzing and portraying numeric data graphically, creating knowledge by combining data from multiple sources, extracting correlations between seemingly disparate variables, and recognizing numeric data trends. This system uses local natural language processing, mathematical analysis, and expert-based scientific heuristics to score the numeric and contextual relevancy of the data to the query parameters.
    • 用于数字数据的信息检索和分析系统,其提供数字搜索的高精度和调用,并且使用用于确定提取的数据的语境化的方法。 功能包括提取,解析和上下文数字数据,包括数字值和附带单位。 该系统便于将大量非结构化数字数据组织成反向索引和其他数据库格式。 一种信息检索系统,其能够探索和细化由搜索输入定义的提取的数字数据集,其可以是精确的或最初模糊的。 该系统还有助于以图形方式分析和描绘数字数据,通过组合来自多个来源的数据创建知识,提取看似不同的变量之间的相关性,并识别数字数据趋势。 该系统使用本地自然语言处理,数学分析和基于专家的科学启发式来对数据和查询参数的数据和上下文相关性进行评分。
    • 10. 发明授权
    • Generalized data mining and analytics apparatuses, methods and systems
    • 广义数据挖掘和分析设备,方法和系统
    • US09183203B1
    • 2015-11-10
    • US13252559
    • 2011-10-04
    • Ari TuchmanYaron GalantErich NachbarJohn StocktonKarthik Thiyagarajan
    • Ari TuchmanYaron GalantErich NachbarJohn StocktonKarthik Thiyagarajan
    • G06F17/30
    • G06F17/30011G06F17/3061G06F17/3064G06F17/30646G06F17/30651G06F17/3069G06F17/30696G06F2216/11
    • The GENERALIZED DATA MINING AND ANALYTICS APPARATUSES, METHODS AND SYSTEMS (“GDMA”), in various embodiments, may identify statistical relationships among query terms by analyzing a corpus of electronic documents. Inputs may be automatically generated automatically and/or user provided. In one embodiment, a method includes: accessing a term tensor associated with at least one term in a corpus of documents, wherein the term tensor comprises a plurality of data type vectors corresponding respectively to a plurality of term-correlated data types correlated with the at least one term in the corpus and each data type vector comprising a plurality of binned data type values with corresponding weighted occurrence values derived from the corpus; providing at least one of the plurality of term-correlated data types for selectable display; receiving at least one term-correlated data type selection; and providing data type values associated with the at least one term-correlated data type selection for display.
    • 在各种实施例中,通用数据挖掘和分析装置,方法和系统(“GDMA”)可以通过分析电子文档的语料库来识别查询词之间的统计关系。 输入可以自动自动生成和/或用户提供。 在一个实施例中,一种方法包括:访问与文档语料库中的至少一个项相关联的项张量,其中所述项张量包括分别对应于与所述文档相关联的多个项相关数据类型的多个数据类型向量 语料库中的至少一个项目和每个数据类型向量包括具有从语料库导出的对应加权出现值的多个合并数据类型值; 提供所述多个术语相关数据类型中的至少一个用于可选择显示; 接收至少一个术语相关数据类型选择; 以及提供与所述至少一个术语相关数据类型选择相关联的数据类型值以进行显示。