会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Using categorical metadata to rank search results
    • 使用分类元数据对搜索结果进行排名
    • US09020936B2
    • 2015-04-28
    • US12541166
    • 2009-08-14
    • Krysta Marie SvorePaul Nathan BennettSusan T. Dumais
    • Krysta Marie SvorePaul Nathan BennettSusan T. Dumais
    • G06F17/30
    • G06F17/30864G06F17/30696
    • A system that facilitates ranking search results returned by a search engine in response to receipt of a query is described herein. The system includes a receiver component that receives categorical metadata pertaining to an item and categorical metadata pertaining to the query and a computation component that computes at least one of a document feature pertaining to the item, a query feature pertaining to the query, or a document-query feature pertaining to the item and the query based at least in part upon one or more of the categorical metadata pertaining to the item or the categorical metadata pertaining to the query. The system also includes a ranker component that selectively places the item in a particular location in a sequence of items based at least in part upon the at least one of the document feature, the query feature, or the document-query feature.
    • 本文描述了有助于响应于接收查询而由搜索引擎返回的排序搜索结果的系统。 所述系统包括:接收器组件,其接收与项目有关的分类元数据和与所述查询有关的分类元数据;以及计算组件,其计算与所述项目相关的文档特征,与所述查询有关的查询特征,或者文档中的至少一个 至少部分地基于与所述项目有关的一个或多个所述分类元数据或与所述查询有关的所述分类元数据,所述关于所述项目和所述查询的查询特征。 系统还包括筛选器组件,其至少部分地基于文档特征,查询特征或文档查询特征中的至少一个,将项目选择性地放置在项目序列中的特定位置。
    • 2. 发明授权
    • Ranking based on social activity data
    • 基于社会活动数据排名
    • US08972399B2
    • 2015-03-03
    • US13531488
    • 2012-06-22
    • Paul Nathan BennettEmre Mehmet KicimanPeter Richard BaileyNikhil DandekarHuizhong Duan
    • Paul Nathan BennettEmre Mehmet KicimanPeter Richard BaileyNikhil DandekarHuizhong Duan
    • G06F17/30
    • G06F17/30867
    • Various technologies described herein pertain to using social activity data to personalize ranking of results returned by a computing operation for a user. For each of the results returned by the computing operation, a respective first affinity of the user to a corresponding result and a respective second affinity of the user to the corresponding result can be calculated and used for ranking the results. The respective first affinity of the user to the corresponding result can be calculated based on correlations between social activity data of the user and social activity data of a first group of historical users that clicked the corresponding result. Moreover, the respective second affinity of the user to the corresponding result can be calculated based on correlations between the social activity data of the user and social activity data of a second group of historical users that skipped the corresponding results.
    • 本文描述的各种技术涉及使用社交活动数据来个性化由用户计算操作返回的结果的排名。 对于通过计算操作返回的每个结果,可以计算用户对相应结果的相应第一亲和度和用户对相应结果的相应第二亲和度,并用于对结果进行排名。 可以基于用户的社交活动数据和点击相应结果的第一组历史用户的社交活动数据之间的相关性来计算用户对相应结果的第一亲和度。 此外,可以基于用户的社交活动数据和跳过相应结果的第二组历史用户的社交活动数据之间的相关性来计算用户对相应结果的相应第二亲和度。
    • 4. 发明授权
    • Probablistic models and methods for combining multiple content classifiers
    • 用于组合多个内容分类器的概念模型和方法
    • US07107254B1
    • 2006-09-12
    • US09850172
    • 2001-05-07
    • Susan T. DumaisEric J. HorvitzPaul Nathan Bennett
    • Susan T. DumaisEric J. HorvitzPaul Nathan Bennett
    • G06N5/02
    • G06N7/005
    • The invention applies a probabilistic approach to combining evidence regarding the correct classification of items. Training data and machine learning techniques are used to construct probabilistic dependency models that effectively utilize evidence. The evidence includes the outputs of one or more classifiers and optionally one or more reliability indicators. The reliability indicators are, in a broad sense, attributes of the items being classified. These attributes can include characteristics of an item, source of an item, and meta-level outputs of classifiers applied to the item. The resulting models include meta-classifiers, which combine evidence from two or more classifiers, and tuned classifiers, which use reliability indicators to inform the interpretation of classical classifier outputs. The invention also provides systems and methods for identifying new reliability indicators.
    • 本发明应用概率方法来合并关于项目正确分类的证据。 训练数据和机器学习技术用于构建有效利用证据的概率依赖模型。 证据包括一个或多个分类器的输出和可选的一个或多个可靠性指标。 在广义上,可靠性指标是被归类的物品的属性。 这些属性可以包括项目的特征,项目的来源以及应用于项目的分类器的元级输出。 所得到的模型包括组合来自两个或更多个分类器的证据的分类器和使用可靠性指标来通知经典分类器输出的解释的调谐分类器。 本发明还提供用于识别新的可靠性指标的系统和方法。
    • 5. 发明授权
    • Measuring duplication in search results
    • 测量搜索结果中的重复
    • US08825641B2
    • 2014-09-02
    • US12942553
    • 2010-11-09
    • Filip RadlinskiPaul Nathan BennettEmine Yilmaz
    • Filip RadlinskiPaul Nathan BennettEmine Yilmaz
    • G06F17/30
    • G06F17/30G06F17/30696G06F17/30864
    • Measuring duplication in search results is described. In one example, duplication between a pair of results provided by an information retrieval system in response to a query is measured. History data for the information retrieval system is accessed and query data retrieved, which describes the number of times that users have previously selected either or both of the pair of results, and a relative presentation sequence of the pair of results when displayed at each selection. From the query data, a fraction of user selections is determined in which a predefined combination of one or both of the pair of results were selected for a predefined presentation sequence. From the fraction, a measure of duplication between the pair of results is found. In further examples, the information retrieval system uses the measure of duplication to determine an overall redundancy value for a result set, and controls the result display accordingly.
    • 描述了搜索结果中的重复测量。 在一个示例中,测量由信息检索系统响应于查询提供的一对结果之间的重复。 访问信息检索系统的历史数据,并且检索查询数据,该数据描述了用户先前选择了该对结果中的一个或两个的次数,以及在每次选择时显示该对结果的相对呈现序列。 从查询数据中,确定一小部分用户选择,其中针对预定义的呈现序列选择了一对或两者的预定组合。 从分数中可以发现一对结果之间的重复度。 在其他示例中,信息检索系统使用复制度来确定结果集的总体冗余度值,并相应地控制结果显示。
    • 6. 发明申请
    • RANKING SEARCH RESULTS USING RESULT REPETITION
    • 使用结果重复排列搜索结果
    • US20130246412A1
    • 2013-09-19
    • US13420591
    • 2012-03-14
    • Milad ShokouhiRyen William WhitePaul Nathan Bennett
    • Milad ShokouhiRyen William WhitePaul Nathan Bennett
    • G06F17/30
    • G06F17/30867
    • Ranking search results using result repetition is described. In an embodiment, a set of results generated by a search engine is ranked or re-ranked based on whether any of the results were included in previous sets of results generated in response to earlier queries by the same user in one or more searching sessions. User behavior data, such as whether a user clicks on a result, skips a result or misses a result, is stored in real-time and the stored data is used in performing the ranking. In various examples, the ranking is performed using a machine-learning algorithm and various parameters, such as whether a result in a current set of results has previously been clicked, skipped or missed in the same session, are generated based on the user behavior data for the current session and input to the machine-learning algorithm.
    • 描述使用结果重复排列搜索结果。 在一个实施例中,基于是否将任何结果包括在响应于同一用户在一个或多个搜索会话中的较早查询而生成的先前的结果集合中来对搜索引擎生成的一组结果进行排名或重新排序。 实时存储用户行为数据,例如用户是否点击结果,跳过结果或丢失结果,并且存储的数据用于执行排名。 在各种示例中,使用机器学习算法执行排名,并且基于用户行为数据生成各种参数,诸如是否先前在同一会话中点击,跳过或丢失了当前结果集中的结果 为当前会话并输入到机器学习算法。
    • 7. 发明授权
    • Structured cross-lingual relevance feedback for enhancing search results
    • 结构化的跨语言相关性反馈,以增强搜索结果
    • US08645289B2
    • 2014-02-04
    • US12970879
    • 2010-12-16
    • Paul Nathan BennettJianfeng GaoJagadeesh JagarlamudiKristen Patricia Parton
    • Paul Nathan BennettJianfeng GaoJagadeesh JagarlamudiKristen Patricia Parton
    • G06F15/18
    • G06F17/30669G06F17/30675
    • A “Cross-Lingual Unified Relevance Model” provides a feedback model that improves a machine-learned ranker for a language with few training resources, using feedback from a more complete ranker for a language that has more training resources. The model focuses on linguistically non-local queries, such as “world cup” (English language/U.S. market) and “copa mundial” (Spanish language/Mexican market), that have similar user intent in different languages and markets or regions, thus allowing the low-resource ranker to receive direct relevance feedback from the high-resource ranker. Among other things, the Cross-Lingual Unified Relevance Model differs from conventional relevancy-based techniques by incorporating both query- and document-level features. More specifically, the Cross-Lingual Unified Relevance Model generalizes existing cross-lingual feedback models, incorporating both query expansion and document re-ranking to further amplify the signal from the high-resource ranker to enable a learning to rank approach based on appropriately labeled training data.
    • “跨语言统一相关性模型”提供了一种反馈模型,可以为少数培训资源的语言改进机器学习游戏者,使用更完整的游戏者的反馈来获得具有更多培训资源的语言。 该模式侧重于语言上的非本地查询,例如“世界杯”(英语/美国市场)和“复合世界”(西班牙语/墨西哥市场),在不同语言和市场或区域具有类似的用户意图,因此 允许低资源游击队员接收来自高资源队员的直接相关反馈。 其中,跨语言统一相关性模型与传统的相关性技术不同,包括查询和文档级功能。 更具体地说,跨语言统一相关性模型概括了现有的跨语言反馈模型,其中包括查询扩展和文档重新排序,以进一步放大来自高资源游戏者的信号,以使学习能够基于适当标记的训练进行排名 数据。
    • 9. 发明申请
    • Measuring Duplication in Search Results
    • 测量搜索结果中的重复
    • US20120117043A1
    • 2012-05-10
    • US12942553
    • 2010-11-09
    • Filip RadlinskiPaul Nathan BennettEmine Yilmaz
    • Filip RadlinskiPaul Nathan BennettEmine Yilmaz
    • G06F17/30
    • G06F17/30G06F17/30696G06F17/30864
    • Measuring duplication in search results is described. In one example, duplication between a pair of results provided by an information retrieval system in response to a query is measured. History data for the information retrieval system is accessed and query data retrieved, which describes the number of times that users have previously selected either or both of the pair of results, and a relative presentation sequence of the pair of results when displayed at each selection. From the query data, a fraction of user selections is determined in which a predefined combination of one or both of the pair of results were selected for a predefined presentation sequence. From the fraction, a measure of duplication between the pair of results is found. In further examples, the information retrieval system uses the measure of duplication to determine an overall redundancy value for a result set, and controls the result display accordingly.
    • 描述了搜索结果中的重复测量。 在一个示例中,测量由信息检索系统响应于查询提供的一对结果之间的重复。 访问信息检索系统的历史数据,并且检索查询数据,该数据描述了用户先前选择了该对结果中的一个或两个的次数,以及在每次选择时显示该对结果的相对呈现序列。 从查询数据中,确定一小部分用户选择,其中针对预定义的呈现序列选择了一对或两者的预定组合。 从分数中可以发现一对结果之间的重复度。 在其他示例中,信息检索系统使用复制度来确定结果集的总体冗余度值,并相应地控制结果显示。
    • 10. 发明申请
    • USING CATEGORICAL METADATA TO RANK SEARCH RESULTS
    • 使用分类元数据来排列搜索结果
    • US20110040752A1
    • 2011-02-17
    • US12541166
    • 2009-08-14
    • Krysta Marie SvorePaul Nathan BennettSusan T. Dumais
    • Krysta Marie SvorePaul Nathan BennettSusan T. Dumais
    • G06F17/30
    • G06F17/30864G06F17/30696
    • A system that facilitates ranking search results returned by a search engine in response to receipt of a query is described herein. The system includes a receiver component that receives categorical metadata pertaining to an item and categorical metadata pertaining to the query and a computation component that computes at least one of a document feature pertaining to the item, a query feature pertaining to the query, or a document-query feature pertaining to the item and the query based at least in part upon one or more of the categorical metadata pertaining to the item or the categorical metadata pertaining to the query. The system also includes a ranker component that selectively places the item in a particular location in a sequence of items based at least in part upon the at least one of the document feature, the query feature, or the document-query feature.
    • 本文描述了有助于响应于接收查询而由搜索引擎返回的排序搜索结果的系统。 所述系统包括:接收器组件,其接收与项目有关的分类元数据和与所述查询有关的分类元数据;以及计算组件,其计算与所述项目相关的文档特征,与所述查询有关的查询特征,或者文档中的至少一个 至少部分地基于与所述项目有关的一个或多个所述分类元数据或与所述查询有关的所述分类元数据,所述关于所述项目和所述查询的查询特征。 系统还包括筛选器组件,其至少部分地基于文档特征,查询特征或文档查询特征中的至少一个,将项目选择性地放置在项目序列中的特定位置。