专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US08538898B2 Interactive framework for name disambiguation 有权
标题翻译：互动框架的名称消歧
公开(公告)号：US08538898B2
公开(公告)日：2013-09-17
申请号：US13118404
申请日：2011-05-28
申请人： Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
发明人： Zhengdong Lu , Zaiqing Nie , Gang Luo , Yong Cao , Ji-Rong Wen , Wei-Ying Ma
IPC分类号： G06N5/00
CPC分类号： G06N99/005 , G06F17/30616
摘要： A “Name Disambiguator” provides various techniques for implementing an interactive framework for resolving or disambiguating entity names (associated with objects such as publications) for entity searches where two or more same or similar names may refer to different entities. More specifically, the Name Disambiguator uses a combination of user input and automatic models to address the disambiguation problem. In various embodiments, the Name Disambiguator uses a two part process, including: 1) a global SVM trained from large sets of documents or objects in a simulated interactive mode, and 2) further personalization of local SVM models (associated with individual names or groups of names such as, for example, a group of coauthors) derived from the global SVM model. The result of this process is that large sets of documents or objects are rapidly and accurately condensed or clustered into ordered sets by that are organized by entity names.
摘要翻译： “名称歧义者”提供了各种技术，用于实现用于解析或消除实体名称（与诸如出版物的对象相关联）的交互式框架，用于实体搜索，其中两个或多个相同或相似的名称可以指代不同的实体。更具体地说，名称消歧器使用用户输入和自动模型的组合来解决消歧问题。在各种实施例中，名称消歧器使用两部分过程，包括：1）以模拟交互模式从大量文档或对象训练的全局SVM，以及2）本地SVM模型的进一步个性化（与个体名称或组相关联来自全球SVM模型的名称，例如一组合作者。这个过程的结果是，大量的文档或对象可以通过按实体名称组织的快速，准确的浓缩或聚类成有序集。

2. 发明授权

US07720830B2 Hierarchical conditional random fields for web extraction 失效
标题翻译： Web提取的分层条件随机字段
公开(公告)号：US07720830B2
公开(公告)日：2010-05-18
申请号：US11461400
申请日：2006-07-31
申请人： Ji-Rong Wen , Wei-Ying Ma , Zaiqing Nie , Jun Zhu
发明人： Ji-Rong Wen , Wei-Ying Ma , Zaiqing Nie , Jun Zhu
IPC分类号： G06F7/00 , G06F17/30 , G06F17/00 , G06F15/173
CPC分类号： G06F17/3089 , G06F17/30994
摘要： A method and system for labeling object information of an information page is provided. A labeling system identifies an object record of an information page based on the labeling of object elements within an object record and labels object elements based on the identification of an object record that contains the object elements. To identify the records and label the elements, the labeling system generates a hierarchical representation of blocks of an information page. The labeling system identifies records and elements within the records by propagating probability-related information of record labels and element labels through the hierarchy of the blocks. The labeling system generates a feature vector for each block to represent the block and calculates a probability of a label for a block being correct based on a score derived from the feature vectors associated with related blocks. The labeling system searches for the labeling of records and elements that has the highest probability of being correct.
摘要翻译：提供了一种用于标记信息页面的对象信息的方法和系统。标签系统基于对象记录中的对象元素的标签来识别信息页面的对象记录，并且基于包含对象元素的对象记录的标识来标记对象元素。为了识别记录并标记元素，标签系统生成信息页的块的分层表示。标签系统通过块的层次传播记录标签和元素标签的概率相关信息来识别记录中的记录和元素。标签系统为每个块生成特征向量以表示块，并且基于从与相关块相关联的特征向量导出的分数来计算块正确的标签的概率。标签系统搜索具有最高准确概率的记录和元素的标签。

3. 发明授权

US07529748B2 Information classification paradigm 有权
标题翻译：信息分类范式
公开(公告)号：US07529748B2
公开(公告)日：2009-05-05
申请号：US11276818
申请日：2006-03-15
申请人： Ji-Rong Wen , Yan-Feng Sun , Wei-Ying Ma , Zaiqing Nie , Renkuan Jiang
发明人： Ji-Rong Wen , Yan-Feng Sun , Wei-Ying Ma , Zaiqing Nie , Renkuan Jiang
IPC分类号： G06F17/30
CPC分类号： G06F17/30707 , Y10S707/99933 , Y10S707/99937
摘要： A mechanism to classify source documents into one of two categories, either likely to contain desired information or unlikely to contain desired information. Generally some form of rules based classification in conjunction with deeper analysis using advanced techniques on difficult cases is utilized. The rules based classification is generally good for eliminating cases from further consideration and for identifying documents of interest based on generally discernable relationships between data or based on the presence or absence of data. The deeper analysis is used to uncover more complex relationships between data that may identify documents of interest. Portions of the process may use the entire document while other portions of the process may use only a portion of the document.
摘要翻译：将源文档分类为两个类别之一的机制，可能包含所需信息或不太可能包含所需信息。通常使用某种形式的基于规则的分类，结合使用先进技术在困难案例上进行更深入的分析。基于规则的分类通常对于消除进一步考虑的情况以及基于数据之间的一般可辨别的关系或基于数据的存在或不存在来识别感兴趣的文档是有益的。更深入的分析用于发现可能识别感兴趣文档的数据之间更复杂的关系。过程的一部分可以使用整个文档，而进程的其他部分可以仅使用文档的一部分。

4. 发明申请

US20080215563A1 Pseudo-Anchor Text Extraction for Vertical Search 失效
标题翻译：用于垂直搜索的伪锚文本提取
公开(公告)号：US20080215563A1
公开(公告)日：2008-09-04
申请号：US11681682
申请日：2007-03-02
申请人： Shuming Shi , Zaiqing Nie , Ji-Rong Wen , Mingjie Zhu , Fei Xing
发明人： Shuming Shi , Zaiqing Nie , Ji-Rong Wen , Mingjie Zhu , Fei Xing
IPC分类号： G06F17/30
CPC分类号： G06F17/30616 , G06F17/30864 , Y10S707/99932
摘要： A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help ranking the objects in a search result to improve search performance. Method may be used in vertical search of objects such as published articles, products and images that lack explicit URL and anchor text information.
摘要翻译：搜索方法使用与搜索对象相关联的伪锚文本来改善搜索性能。伪锚文本可以与来自诸如文档集合的数字语料库的搜索对象（诸如伪URL）的标识符组合提取。优选地，使用基于机器学习的方法从候选锚块中提取每个对象的伪锚文本。伪锚文本可用于搜索，并用于帮助对搜索结果中的对象进行排名以提高搜索性能。方法可用于垂直搜索诸如已发表的文章，产品和图像之类的对象，缺少明确的URL和锚文本信息。

5. 发明授权

US09990429B2 Automated social networking graph mining and visualization 有权
公开(公告)号：US09990429B2
公开(公告)日：2018-06-05
申请号：US12780522
申请日：2010-05-14
申请人： Zaiqing Nie , Yong Cao , Gang Luo , Ruochi Zhang , Xiaojiang Liu , Yunxiao Ma , Bo Zhang , Ying-Qing Xu , Ji-Rong Wen
发明人： Zaiqing Nie , Yong Cao , Gang Luo , Ruochi Zhang , Xiaojiang Liu , Yunxiao Ma , Bo Zhang , Ying-Qing Xu , Ji-Rong Wen
IPC分类号： G06F3/0481 , G06F3/0482 , G06F3/0483 , G06F8/38 , G06F17/30
CPC分类号： G06F17/30867
摘要： The automated social networking graph mining and visualization technique described herein mines social connections and allows creation of a social networking graph from general (not necessarily social-application specific) Web pages. The technique uses the distances between a person's/entity's name and related people's/entities names on one or more Web pages to determine connections between people/entities and the strengths of the connections. In one embodiment, the technique lays out these connections, and then clusters them, in a 2-D layout of a social networking graph that represents the Web connection strengths among the related people's or entities' names, by using a force-directed model.

6. 发明授权

US09305083B2 Author disambiguation 有权
标题翻译：作者消歧
公开(公告)号：US09305083B2
公开(公告)日：2016-04-05
申请号：US13358884
申请日：2012-01-26
申请人： Yunhua Hu , Zaiqing Nie
发明人： Yunhua Hu , Zaiqing Nie
IPC分类号： G06F17/30
CPC分类号： G06F17/30705 , G06F17/30864
摘要： The techniques described herein automatically generate high precision clusters and high recall clusters for a set of documents having an author with a same or similar name. The high precision clusters and the high recall clusters can then be used in a labeling process so that efficient and accurate author disambiguation is realized.
摘要翻译：本文描述的技术自动地生成具有相同或相似名称的作者的一组文档的高精度簇和高回忆簇。然后可以将高精度集群和高回收集群用于标签过程，从而实现高效准确的作者消歧。

7. 发明申请

US20130198192A1 AUTHOR DISAMBIGUATION 有权
标题翻译：作者拒绝
公开(公告)号：US20130198192A1
公开(公告)日：2013-08-01
申请号：US13358884
申请日：2012-01-26
申请人： Yunhua Hu , Zaiqing Nie
发明人： Yunhua Hu , Zaiqing Nie
IPC分类号： G06F17/30
CPC分类号： G06F17/30705 , G06F17/30864
摘要： The techniques described herein automatically generate high precision clusters and high recall clusters for a set of documents having an author with a same or similar name. The high precision clusters and the high recall clusters can then be used in a labeling process so that efficient and accurate author disambiguation is realized.
摘要翻译：本文描述的技术自动地生成具有相同或相似名称的作者的一组文档的高精度簇和高回忆簇。然后可以将高精度集群和高回收集群用于标签过程，从而实现高效准确的作者消歧。

8. 发明授权

US08229960B2 Web-scale entity summarization 有权
标题翻译：网络规模实体总结
公开(公告)号：US08229960B2
公开(公告)日：2012-07-24
申请号：US12570023
申请日：2009-09-30
申请人： Zaiqing Nie , Ji-Rong Wen , Liu Yang
发明人： Zaiqing Nie , Ji-Rong Wen , Liu Yang
IPC分类号： G06F17/00
CPC分类号： G06F17/30867
摘要： Described is a summarizing a web entity (e.g., a person, place, product or so forth) based upon the entity's appearance in web documents (e.g., on the order of hundreds of millions or billions of webpages). Webpages are separated into blocks, which are then processed according to various features to filter the number of blocks to further process, and rank the most relevant blocks with respect to the entity that remain. A redundancy removal mechanism removes redundant blocks, leaving a set of remaining blocks that are used to provide a summary of information that is relevant to the entity.
摘要翻译：描述了基于实体在web文档中的出现（例如，数亿或数十亿个网页的数量级）来汇总web实体（例如，人，地点，产品等）。网页被分成块，然后根据各种特征来处理块以过滤块的数量以进一步处理，并且相对于保留的实体排列最相关的块。冗余删除机制去除冗余块，留下一组用于提供与该实体相关的信息摘要的剩余块。

9. 发明授权

US07831685B2 Automatic detection of online commercial intention 失效
标题翻译：自动检测在线商业意图
公开(公告)号：US07831685B2
公开(公告)日：2010-11-09
申请号：US11300748
申请日：2005-12-14
申请人： Honghua Dai , Lee Wang , Ying Li , Zaiqing Nie , Ji-Rong Wen , Lingzhi Zhao
发明人： Honghua Dai , Lee Wang , Ying Li , Zaiqing Nie , Ji-Rong Wen , Lingzhi Zhao
IPC分类号： G06F15/16 , G06Q30/00
CPC分类号： G06Q30/02
摘要： Features extracted from network browser pages and/or network search queries are leveraged to facilitate in detecting a user's browsing and/or searching intent. Machine learning classifiers constructed from these features automatically detect a user's online commercial intention (OCI). A user's intention can be commercial or non-commercial, with commercial intentions being informational or transactional. In one instance, an OCI ranking mechanism is employed with a search engine to facilitate in providing search results that are ranked according to a user's intention. This also provides a means to match purchasing advertisements with potential customers who are more than likely ready to make a purchase (transactional stage). Additionally, informational advertisements can be matched to users who are researching a potential purchase (informational stage).
摘要翻译：从网络浏览器页面和/或网络搜索查询中提取的特征被利用以便于检测用户的浏览和/或搜索意图。从这些功能构建的机器学习分类器自动检测用户的在线商业意图（OCI）。用户的意图可以是商业的或非商业的，商业意图是信息或交易的。在一种情况下，使用OCI排名机制与搜索引擎，以便于提供根据用户意图进行排名的搜索结果。这也提供了一种方法来将购买广告与潜在客户相匹配，潜在客户可能准备进行购买（交易阶段）。此外，信息广告可以与正在研究潜在购买（信息阶段）的用户匹配。

10. 发明申请

US20100145956A1 PSEUDO-ANCHOR TEXT EXTRACTION 有权
标题翻译： PSEUDO-ANCHOR文本提取
公开(公告)号：US20100145956A1
公开(公告)日：2010-06-10
申请号：US12697056
申请日：2010-01-29
申请人： Shuming Shi , Zaiqing Nie , Ji-Rong Wen , Mingjie Zhu , Fei Xing
发明人： Shuming Shi , Zaiqing Nie , Ji-Rong Wen , Mingjie Zhu , Fei Xing
IPC分类号： G06F17/30
CPC分类号： G06F17/30616 , G06F17/30864 , Y10S707/99932
摘要： A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help rank the objects in a search result to improve search performance. The method may be used in vertical search of objects such as published articles, products and images that lack explicit URLs and anchor text information.
摘要翻译：搜索方法使用与搜索对象相关联的伪锚文本来改善搜索性能。伪锚文本可以与来自诸如文档集合的数字语料库的搜索对象（诸如伪URL）的标识符组合提取。优选地，使用基于机器学习的方法从候选锚块中提取每个对象的伪锚文本。伪锚文本可用于搜索，并用于帮助对搜索结果中的对象进行排名以提高搜索性能。该方法可以用于垂直搜索诸如已发表的文章，产品和缺乏明确的URL和锚文本信息的图像的对象。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式