会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • System and method for extracting entities of interest from text using n-gram models
    • 使用n-gram模型从文本中提取感兴趣的实体的系统和方法
    • US07493293B2
    • 2009-02-17
    • US11421379
    • 2006-05-31
    • Tapas KanungoJames J. Rhodes
    • Tapas KanungoJames J. Rhodes
    • G06F15/18
    • G06F17/278
    • A document (or multiple documents) is analyzed to identify entities of interest within that document. This is accomplished by constructing n-gram or bi-gram models that correspond to different kinds of text entities, such as chemistry-related words and generic English words. The models can be constructed from training text selected to reflect a particular kind of text entity. The document is tokenized, and the tokens are run against the models to determine, for each token, which kind of text entity is most likely to be associated with that token. The entities of interest in the document can then be annotated accordingly.
    • 分析文档(或多个文档)以识别该文档中感兴趣的实体。 这是通过构建对应于不同类型的文本实体(如化学相关词和通用英文单词)的n-gram或bi-gram模型来实现的。 这些模型可以通过选择的训练文本来构建,以反映特定类型的文本实体。 文档被标记化,并且令牌针对模型运行,以针对每个令牌确定哪种文本实体最有可能与该令牌相关联。 然后可以相应地注释文档中感兴趣的实体。
    • 7. 发明授权
    • System, method, and service for using a focused random walk to produce samples on a topic from a collection of hyper-linked pages
    • 系统,方法和服务,用于使用集中的随机游走从超链接页面集合中的主题生成样本
    • US07640488B2
    • 2009-12-29
    • US11004412
    • 2004-12-04
    • Ziv Bar-YossefTapas KanungoRobert Krauthgamer
    • Ziv Bar-YossefTapas KanungoRobert Krauthgamer
    • G06F17/00G06F17/20
    • G06F17/30864
    • A focused random walk system produces samples of on-topic pages from a collection of hyper-linked pages such as Web pages. The focused random walk system utilizes a focused random walk to produce a focused sample, which is a random sample of Web pages focused on a topic. The focused random walk system uniformly samples pages iteratively, where each iteration follows a random link from a union of the in-links and out-links of a page. The system then classifies this randomly selected link to determine whether the page is on-topic. The random walk sampling process could comprise a hard-focus method that selects only on-topic pages at each step of the focused random walk, or a soft-focus method that allows limited divergence to off-topic pages.
    • 集中的随机游走系统从一系列超链接页面(如网页)生成主题页面的样本。 集中的随机游走系统利用一个集中的随机游走来产生一个聚焦的样本,这是一个专注于主题的网页的随机抽样。 集中的随机游走系统统一地对页面进行一次抽样,其中每次迭代都遵循一个页面的链接和外链的联合的随机链接。 然后,系统对这个随机选择的链接进行分类,以确定该页面是否是主题的。 随机游走抽样过程可以包括仅在聚焦随机游走的每个步骤选择专题页面的硬焦点方法,或者允许有限散点到偏离主题页面的软焦点方法。