会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明专利
    • Generating a topic-based summary of textual content
    • GB2573189A
    • 2019-10-30
    • GB201901522
    • 2019-02-04
    • ADOBE INC
    • KUNDAN KRISHNABALAJI VASAN SRINIVASAN
    • G06F16/34G06F40/00
    • A method for generating a summary of textual content tuned to a specific topic involves using a topic-aware encoding model to encode 804 the text using a topic label (e.g. a one-hot vector) to generate topic-aware encoded text. A word generation model selects a next word 808 for the summary from the encoded text. The word generation model is trained using machine learning and training data including documents with corresponding summaries, each having an associated topic. The selected next word is provided as feedback 810 to the word generation model. Also disclosed is a method for training the encoding word generation models by obtaining an intermediate dataset comprising documents, and a summary of each document with an associated topic. Training data is generated by merging the text of first and second documents to make a new document which is associated with the summary and topic of the first document, merging their text again and associating the resulting new document with the summary and topic of the second document. The original documents are discarded. This process is repeated until the intermediate training set is exhausted. The training data is then used to train the encoding and word generation models.