会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • EXTRACTION OF CERTAIN TYPES OF ENTITIES
    • 提取某些类型的实体
    • US20110131244A1
    • 2011-06-02
    • US12626905
    • 2009-11-29
    • Amir J. PadovitzMatthew F. Hurst
    • Amir J. PadovitzMatthew F. Hurst
    • G06F17/30G06F15/18
    • G06F16/367G06F16/355
    • Certain types of entities may be extracted from a document. In one example, the entities to be recognized are cultural entities, such as the names of movies, video games, books, etc. For each such entity, a concept graph may be built that shows the relationship between the entity itself and other entities, such as the relationship between a movie and the actor(s) who act in the movie. When a candidate entity name is detected in the document, the concept graph may be used to look for other entities that appear in the context of the candidate entity. The presence of related entities in the context of the candidate may be used to disambiguate the meaning of the candidate. For example, a common word like “up” might be recognized as the name of a movie if the names of actors or characters in that movie appear near the word “up”.
    • 可以从文档中提取某些类型的实体。 在一个示例中,要被识别的实体是文化实体,诸如电影,视频游戏,书籍等的名称。对于每个这样的实体,可以构建示出实体本身和其他实体之间的关系的概念图, 例如电影和在电影中扮演的演员之间的关系。 当在文档中检测到候选实体名称时,概念图可以用于查找出现在候选实体的上下文中的其他实体。 在候选人的上下文中存在相关实体可以用来消除候选人的意思。 例如,如果该电影中的演员或角色的名字出现在“up”字样附近,则可能将诸如“up”的常用单词识别为电影的名称。
    • 9. 发明授权
    • Large scale item representation matching
    • 大型项目表示匹配
    • US07818278B2
    • 2010-10-19
    • US11763200
    • 2007-06-14
    • Amir J. PadovitzDima SuponauWei YuMikhail Bilenko
    • Amir J. PadovitzDima SuponauWei YuMikhail Bilenko
    • G06N5/02
    • G06F17/30542G06F17/30489G06K9/6857Y10S707/917
    • A two-phase process quickly and accurately identifies representations of the same items within a collection of item representations. In the first phase, referred to as a “blocking phase,” frequency information indicating the frequency with which terms appear within the collection of item representations is used to quickly identify “candidate pairs” (i.e., pairs of item representations that have a relatively high probability of matching). The blocking phase results in a reduced subset of the data for further analysis during the second phase. In the second phase, referred to as a “matching phase,” the candidate pairs are analyzed using fuzzy matching functions to accurately identify “matching pairs” (i.e., representations of the same items).
    • 两阶段过程快速准确地识别项目表示集合中相同项目的表示。 在第一阶段中,被称为“阻塞阶段”的频率信息被用于快速识别“候选对”(即,具有相对较高的项目表示的对) 匹配的概率)。 阻塞阶段导致在第二阶段期间用于进一步分析的数据的子集减少。 在称为“匹配阶段”的第二阶段中,使用模糊匹配函数分析候选对以精确地识别“匹配对”(即,相同项目的表示)。