JP7807460B2

JP7807460B2 - Improved virtual meeting user experience based on augmented intelligence

Info

Publication number: JP7807460B2
Application number: JP2023558790A
Authority: JP
Inventors: ソード、シッダールタ; ジェイン、サンケット; クマールシャーマ、アジャイ; カフィーウディン、ムハンマド; クマールジョシ、マノジュ
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 2021-04-15
Filing date: 2022-03-07
Publication date: 2026-01-27
Anticipated expiration: 2042-03-07
Also published as: CN117157943A; JP2024514062A; US11764985B2; WO2022218062A1; US20220337443A1

Description

本発明は、一般に、データ処理の分野に関し、より詳細には、仮想会議ユーザ・エクスペリエンスを改善するための拡張インテリジェンスに基づくシステムおよび方法に関する。 The present invention relates generally to the field of data processing, and more particularly to a system and method based on augmented intelligence for improving virtual meeting user experience.

拡張インテリジェンスは、人間の能力を高めることにおけるＡＩの支援の役割に重点を置く人工知能（ＡＩ：artificial intelligence）の代替の概念化である。拡張インテリジェンスは、機械学習および深層学習を使用して、人間のインテリジェンスを置き換えるのではなく、強化する。拡張インテリジェンスは、人間の意思決定を改善することによって、人間のインテリジェンスを強化し、延いては、改善された決定に応答して行動が取られる。 Augmented intelligence is an alternative conceptualization of artificial intelligence (AI) that emphasizes the supporting role of AI in enhancing human capabilities. Augmented intelligence uses machine learning and deep learning to enhance, rather than replace, human intelligence. Augmented intelligence enhances human intelligence by improving human decision-making, which in turn allows actions to be taken in response to the improved decisions.

機械学習は、追加のプログラミングなしで経験から学習して改善するＡＩの能力を表す。コンピュータが人間の言語を識別できるようにする自然言語処理は、機械学習の例である。深層学習は、データを処理してパターンを調べる人間の脳の能力を模倣するＡＩのプロセスを表す。 Machine learning describes AI's ability to learn and improve from experience without additional programming. Natural language processing, which enables computers to identify human language, is an example of machine learning. Deep learning describes AI processes that mimic the human brain's ability to process data and examine patterns.

拡張インテリジェンスとＡＩの間の重要な違いは、自律性の違いである。拡張インテリジェンスは、他の方法では人間の意思決定者を圧倒する大量のデータを処理し、先入観、疲労、または注意散漫などの、データがゆがめられるか、または間違って解釈されることを引き起こす要因を取り除く。拡張インテリジェンスは、データを分析し、パターンを識別し、それらのパターンをユーザに報告し、その後、人間のインテリジェンスが引き継ぐことができるようにする。拡張インテリジェンスは、システムがユーザの個別の嗜好および期待についてユーザから学習することを可能にし、好みに合わせて調整されて個人化された経験を提供する。 A key difference between augmented intelligence and AI is the difference in autonomy. Augmented intelligence processes large amounts of data that would otherwise overwhelm a human decision maker, eliminating factors such as bias, fatigue, or distraction that would cause the data to be distorted or misinterpreted. Augmented intelligence analyzes the data, identifies patterns, and reports those patterns to the user, after which human intelligence can take over. Augmented intelligence enables the system to learn from the user about their individual preferences and expectations, providing a personalized experience tailored to their preferences.

拡張インテリジェンスの例は、ストリーミング・ビデオ・サービスによって提供された推奨の視聴である。拡張インテリジェンス・アルゴリズムは、ユーザの視聴習慣を分析し、それらの習慣に基づいて追加の視聴を推奨する。その後、推奨に従って行動するかどうかの判断は、ユーザに任される。拡張インテリジェンスには、任意の産業においてパターンおよび予測指標についてビッグ・データをマイニングする応用もある。例としては、顧客の嗜好を予測するためにデータ分析を使用するオンライン・ストア、自然言語処理に基づく仮想的顧客サービス支援、浮動票を有する人々を識別するためにビッグ・データ分析を使用する政治シンク・タンク、効率的な治療選択肢を識別するためのケース・ファイルの医学分析、人間の従業員によって監視される工場自動化、過去のデータに基づく工場の機器の予知保全、周囲の環境およびデータを使用して拡張現実のイベントを作成し、コンピュータによって生成された画像をスマートフォンのカメラ画面に重ね合わせる、モバイル・ビデオ・ゲーム、飛行機およびドローンのオートパイロット・システム、株式市場のパターンを監視および識別する投資および金融アプリケーションが挙げられるが、これらに限定されない。 An example of augmented intelligence is viewing recommendations provided by a streaming video service. Augmented intelligence algorithms analyze a user's viewing habits and recommend additional viewings based on those habits. The user is then left to decide whether to act on the recommendations. Augmented intelligence also has applications in mining big data for patterns and predictive indicators in any industry. Examples include, but are not limited to, online stores using data analytics to predict customer preferences, virtual customer service assistants based on natural language processing, political think tanks using big data analytics to identify swing voters, medical analysis of case files to identify efficient treatment options, factory automation overseen by human employees, predictive maintenance of factory equipment based on historical data, mobile video games that use the surrounding environment and data to create augmented reality events and overlay computer-generated images on a smartphone camera screen, autopilot systems for airplanes and drones, and investment and finance applications that monitor and identify stock market patterns.

一方、ＡＩは、どのような人間の支援もなしで動作する。ＡＩの例は、電子メール・スパム・フィルタまたはＡＩを利用する検索提案である。 AI, on the other hand, operates without any human assistance. Examples of AI are email spam filters or AI-powered search suggestions.

本発明の実施形態の態様は、仮想会議ユーザ・エクスペリエンスを改善するための方法、コンピュータ・プログラム製品、およびコンピュータ・システムを開示する。プロセッサは、事前設定された期間の間、または事前にスケジュールされた仮想会議の割り当てられた合計時間の事前設定された割合の間、少なくとも２人以上の参加者を含んでいる仮想会議から離れているユーザを検出する。プロセッサは、ユーザに関するデータの第１のセット、仮想会議の少なくとも２人の参加者に関するデータの第２のセット、およびユーザと仮想会議の少なくとも２人の参加者の間の関係に関するデータの第３のセットをデータベースから取り出す。プロセッサは、ユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の仮想会議の一部を対象にする、要約を準備する。プロセッサは、ユーザが仮想会議に再接続することを検出する。プロセッサは、事前設定されたユーザの嗜好、ユーザによって行われた決定、または機械駆動の推奨に基づいて、ユーザが仮想会議に再び参加する前に要約をレビュー(review)するかどうかを判定する。ユーザが仮想会議に再び参加する前に要約をレビューするということを決定することに応答して、プロセッサは、デフォルトのユーザの嗜好のセットを使用して、要約をレビューするようユーザに促す。プロセッサは、要約をユーザに出力する。 Aspects of an embodiment of the present invention disclose a method, computer program product, and computer system for improving a virtual conference user experience. A processor detects a user who has been absent from a virtual conference including at least two or more participants for a preset period of time or a preset percentage of the pre-scheduled virtual conference's allotted total time. The processor retrieves from a database a first set of data regarding the user, a second set of data regarding at least two participants of the virtual conference, and a third set of data regarding relationships between the user and the at least two participants of the virtual conference. The processor prepares a summary tailored to the user's profile and covering the portion of the virtual conference during which the user was absent. The processor detects that the user reconnects to the virtual conference. The processor determines whether the user will review the summary before rejoining the virtual conference based on preset user preferences, a user-made decision, or a machine-driven recommendation. In response to determining that the user will review the summary before rejoining the virtual conference, the processor prompts the user to review the summary using a default set of user preferences. The processor outputs the summary to the user.

本発明の実施形態の一部の態様では、要約をユーザに出力した後に、プロセッサは、要約を仮想会議の完全な記録の再生と比較する。プロセッサは、ユーザに対してフィードバックを要求する。プロセッサは、ユーザからフィードバックを受信する。プロセッサは、強化学習を使用して、準備される複数の未来の要約の精度を改善する。プロセッサは、ユーザからのフィードバックをデータベースに格納する。 In some aspects of embodiments of the present invention, after outputting the summary to the user, the processor compares the summary to a playback of the full recording of the virtual meeting. The processor requests feedback from the user. The processor receives feedback from the user. The processor uses reinforcement learning to improve the accuracy of multiple future summaries that are prepared. The processor stores the user feedback in a database.

本発明の実施形態の一部の態様では、プロセッサは、仮想会議の２人以上の参加者を識別する。プロセッサは、要因の第１のセットに基づいて、重み付けを仮想会議の２人以上の参加者に割り当てる。プロセッサは、２人以上の参加者に割り当てられた重み付けに基づいて、仮想会議の２人以上の参加者をランク付けする。 In some aspects of embodiments of the present invention, a processor identifies two or more participants in a virtual conference. The processor assigns weights to the two or more participants in the virtual conference based on a first set of factors. The processor ranks the two or more participants in the virtual conference based on the weights assigned to the two or more participants.

本発明の実施形態の一部の態様では、プロセッサは、少なくとも２人の参加者に割り当てられた重み付けに基づいて仮想会議の少なくとも２人の参加者をランク付けした後に、複数の音声フレーム、複数のビデオ・フレーム、ならびに複数の音声およびビデオ・フレームを、ユーザが切り離されていた間の仮想会議の一部から抽出する。プロセッサは、仮想会議の１つまたは複数のトピックを理解するために、複数の音声フレーム、複数のビデオ・フレーム、ならびに複数の音声およびビデオ・フレームの文脈を識別する。 In some aspects of embodiments of the present invention, a processor ranks at least two participants of the virtual conference based on weights assigned to the at least two participants, and then extracts a plurality of audio frames, a plurality of video frames, and a plurality of audio and video frames from a portion of the virtual conference during which the users were disconnected. The processor identifies context for the plurality of audio frames, the plurality of video frames, and the plurality of audio and video frames to understand one or more topics of the virtual conference.

本発明の実施形態の一部の態様では、プロセッサは、仮想会議の１つまたは複数のトピックを理解するために複数の音声フレーム、複数のビデオ・フレーム、ならびに複数の音声およびビデオ・フレームの文脈を識別した後に、複数の音声フレーム、複数のビデオ・フレーム、ならびに複数の音声およびビデオ・フレームがユーザのプロフィールに関連するかどうかに基づいて、複数の音声フレームのサブセット、複数のビデオ・フレームのサブセット、ならびに複数の音声およびビデオ・フレームのサブセットを選択する。プロセッサは、アルゴリズム法または要因の第２のセットを使用して、複数の音声フレームのサブセット、複数のビデオ・フレームのサブセット、ならびに複数の音声およびビデオ・フレームのサブセットをランク付けする。プロセッサは、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームのサブセット、複数のビデオ・フレームのサブセット、ならびに複数の音声およびビデオ・フレームのサブセットをマージする。プロセッサは、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームのサブセットの統合された要約、アルゴリズムで決定されたしきい値より上にランク付けされた複数のビデオ・フレームのサブセットの統合された要約、ならびにアルゴリズムで決定されたしきい値より上にランク付けされた複数の音声およびビデオ・フレームのサブセットの統合された要約を準備する。 In some aspects of embodiments of the present invention, after identifying the context of the plurality of audio frames, the plurality of video frames, and the plurality of audio and video frames to understand one or more topics of the virtual conference, the processor selects a subset of the plurality of audio frames, a subset of the plurality of video frames, and a subset of the plurality of audio and video frames based on whether the plurality of audio frames, the plurality of video frames, and the plurality of audio and video frames are relevant to a user's profile. The processor ranks the subset of the plurality of audio frames, the subset of the plurality of video frames, and the subset of the plurality of audio and video frames using an algorithmic method or a second set of factors. The processor merges the subset of the plurality of audio frames, the subset of the plurality of video frames, and the subset of the plurality of audio and video frames ranked above an algorithmically determined threshold. The processor prepares a unified summary of a subset of the plurality of audio frames ranked above an algorithmically determined threshold, a unified summary of a subset of the plurality of video frames ranked above an algorithmically determined threshold, and a unified summary of a subset of the plurality of audio and video frames ranked above an algorithmically determined threshold.

本発明の実施形態の一部の態様では、プロセッサは、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームのサブセットの統合された要約、アルゴリズムで決定されたしきい値より上にランク付けされた複数のビデオ・フレームのサブセットの統合された要約、ならびにアルゴリズムで決定されたしきい値より上にランク付けされた複数の音声およびビデオ・フレームのサブセットの統合された要約を準備した後に、抽出テキスト要約アルゴリズムを使用して統合された要約を１つまたは複数の文に変換する。プロセッサは、話者ダイアライゼーション(diarisation)を適用する。プロセッサは、要因の第３のセットに基づいて１つまたは複数の文をランク付けする。プロセッサは、第２の事前設定されたユーザの嗜好または第２の機械駆動の推奨に基づく第２のしきい値より上にランク付けされた１つまたは複数の文のサブセットを保持する。 In some aspects of embodiments of the present invention, the processor prepares an integrated summary of a subset of the plurality of audio frames ranked above an algorithmically determined threshold, an integrated summary of a subset of the plurality of video frames ranked above an algorithmically determined threshold, and an integrated summary of a subset of the plurality of audio and video frames ranked above an algorithmically determined threshold, and then converts the integrated summary into one or more sentences using an extractive text summarization algorithm. The processor applies speaker diarization. The processor ranks the one or more sentences based on a third set of factors. The processor retains a subset of the one or more sentences ranked above a second threshold based on a second preset user preference or a second machine-driven recommendation.

本発明の実施形態の一部の態様では、プロセッサは、第２の事前設定されたユーザの嗜好または第２の機械駆動の推奨に基づく第２のしきい値より上にランク付けされた１つまたは複数の文のサブセットを保持した後に、１つまたは複数の文のサブセット内の１つまたは複数のキーワードを識別する。プロセッサは、仮想会議の１つまたは複数のトピックとの１つまたは複数のキーワードの関連性に基づいて、および仮想会議の少なくとも２人の参加者との１つまたは複数のキーワードの関連性に基づいて、重み付けを１つまたは複数の文のサブセット内の１つまたは複数のキーワードに割り当てる。プロセッサは、１つまたは複数のキーワードを使用して１つまたは複数の文のサブセットをタグ付けする。 In some aspects of embodiments of the present invention, the processor identifies one or more keywords within the subset of one or more sentences after retaining the subset of one or more sentences ranked above a second threshold based on a second preset user preference or a second machine-driven recommendation. The processor assigns weights to one or more keywords within the subset of one or more sentences based on the relevance of the one or more keywords to one or more topics of the virtual conference and based on the relevance of the one or more keywords to at least two participants of the virtual conference. The processor tags the subset of one or more sentences with the one or more keywords.

本発明の実施形態の一部の態様では、プロセッサは、１つまたは複数のキーワードを使用して１つまたは複数の文のサブセットをタグ付けした後に、少なくとも２人の参加者、１つまたは複数のキーワード、および１つまたは複数の文のサブセットを組み込むグローバルなランク付けを準備する。プロセッサは、グローバルなランク付けに基づいて、ユーザのプロフィールに合わせて調整された要約を準備する。 In some aspects of embodiments of the present invention, the processor tags the one or more subsets of sentences with one or more keywords and then prepares a global ranking incorporating at least two participants, the one or more keywords, and the one or more subsets of sentences. The processor prepares a summary tailored to the user's profile based on the global ranking.

本発明の実施形態の一部の態様では、データベースから収集された、ユーザに関するデータの第１のセット、仮想会議の少なくとも２人の参加者に関するデータの第２のセット、ユーザと仮想会議の少なくとも２人の参加者の間の関係に関するデータの第３のセットは、ユーザによって作成されたプロフィール、ユーザの企業プロフィール、ユーザのカレンダー、仮想会議の少なくとも２人の参加者によって作成された１つまたは複数のプロフィール、仮想会議の少なくとも２人の参加者の企業プロフィール、仮想会議の少なくとも２人の参加者のカレンダー、仮想会議の少なくとも２人の参加者によって使用される１つまたは複数のプレゼンテーション・ツール、およびユーザがホストしたか、参加したか、または出席した１つまたは複数の以前の仮想会議からのデータを含む。 In some aspects of embodiments of the present invention, the first set of data regarding the user, the second set of data regarding at least two participants in the virtual conference, and the third set of data regarding relationships between the user and the at least two participants in the virtual conference collected from the database include a profile created by the user, a company profile for the user, the user's calendar, one or more profiles created by the at least two participants in the virtual conference, company profiles for the at least two participants in the virtual conference, calendars for the at least two participants in the virtual conference, one or more presentation tools used by the at least two participants in the virtual conference, and data from one or more previous virtual conferences hosted, participated in, or attended by the user.

本発明の実施形態の一部の態様では、プロセッサは、ユーザが再接続したときの仮想会議中の時間を捕捉する。プロセッサは、ユーザが仮想会議から切り離されていた時間の全持続時間を計算する。 In some aspects of embodiments of the present invention, the processor captures the time during the virtual conference when the user reconnects. The processor calculates the total duration of time the user was disconnected from the virtual conference.

本発明の実施形態の一部の態様では、要約をレビューするためのデフォルトのユーザの嗜好のセットは、言語の嗜好、視聴モードの嗜好、音声の嗜好、およびサブタイトルの嗜好を含む。 In some aspects of embodiments of the present invention, the set of default user preferences for reviewing summaries includes language preferences, viewing mode preferences, audio preferences, and subtitle preferences.

本発明の実施形態の一部の態様では、プロセッサは、１つまたは複数の主要な実体(key entities)の第１のセットを要約から抽出し、１つまたは複数の主要な実体の第２のセットを仮想会議の完全な記録から抽出する。プロセッサは、要約からの１つまたは複数の主要な実体の第１のセットを、仮想会議の完全な記録からの１つまたは複数の主要な実体の第２のセットと照合する。 In some aspects of embodiments of the present invention, a processor extracts a first set of one or more key entities from the summary and a second set of one or more key entities from the full recording of the virtual conference. The processor matches the first set of one or more key entities from the summary with the second set of one or more key entities from the full recording of the virtual conference.

本発明の実施形態の一部の態様では、プロセッサは、ユーザの満足度を表す３つ以上の選択をユーザに提供し、３つ以上の選択は、不満足、中立、および満足を含む。 In some aspects of embodiments of the present invention, the processor provides the user with three or more choices representing the user's satisfaction level, the three or more choices including dissatisfied, neutral, and satisfied.

本発明の実施形態の一部の態様では、要因の第１のセットは、仮想会議における少なくとも２人の参加者の役割、企業における少なくとも２人の参加者の役割、および仮想会議から離れたユーザとの少なくとも２人の参加者の関連性を含む。 In some aspects of embodiments of the present invention, the first set of factors includes the roles of the at least two participants in the virtual conference, the roles of the at least two participants in the enterprise, and the association of the at least two participants with a user remote from the virtual conference.

本発明の実施形態の一部の態様では、アルゴリズムで決定されたしきい値は、動的であり、トレーニング・データ・セットに合わせて設定される。 In some aspects of embodiments of the present invention, the algorithmically determined threshold is dynamic and tailored to the training data set.

本発明の実施形態の一部の態様では、要因の第２のセットは、フレームが、仮想会議から切り離されたユーザの名前を含んでいるかどうか、少なくとも２人の参加者のランク付け、ユーザの技術に関する関心に基づくか、ユーザの専門知識に基づくか、または過去の分析に基づく、仮想会議から離れたユーザとの少なくとも２人の参加者の関連性を含む。 In some aspects of embodiments of the present invention, the second set of factors includes whether the frame includes the name of a user who has disconnected from the virtual conference, a ranking of at least two participants, and the association of at least two participants with the user who has disconnected from the virtual conference based on the user's technology interests, the user's expertise, or past analysis.

本発明の実施形態の一部の態様では、プロセッサは、複数の音声フレーム内の少なくとも２人の参加者間の変更点を検出する。プロセッサは、少なくとも２人の参加者の特性に基づいて、音声のセグメントを一緒にグループ化する。 In some aspects of embodiments of the present invention, a processor detects changes between at least two participants in a plurality of frames of audio. The processor groups segments of audio together based on characteristics of the at least two participants.

本発明の実施形態の一部の態様では、要因の第３のセットは、仮想会議の少なくとも２人の参加者の役割および仮想会議の１つまたは複数のトピックとの１つまたは複数の文の関連性を含む。 In some aspects of embodiments of the present invention, the third set of factors includes the roles of at least two participants in the virtual meeting and the relevance of the one or more statements to one or more topics of the virtual meeting.

本発明の実施形態に従って、分散データ処理環境を示す機能ブロック図である。1 is a functional block diagram illustrating a distributed data processing environment in accordance with an embodiment of the present invention. 本発明の実施形態に従って、図１に示された分散データ処理環境などの分散データ処理環境内のユーザ固有の要約プログラムの設定コンポーネントの動作ステップを示すフローチャートである。2 is a flowchart illustrating the operational steps of a configuration component of a user-specific abstraction program in a distributed data processing environment, such as the distributed data processing environment illustrated in FIG. 1, in accordance with an embodiment of the present invention. 本発明の実施形態に従って、図１に示された分散データ処理環境などの分散データ処理環境内のユーザ固有の要約プログラムの動作ステップを示すフローチャートである。2 is a flowchart illustrating the operational steps of a user-specific summary program in a distributed data processing environment, such as the distributed data processing environment shown in FIG. 1, in accordance with an embodiment of the present invention. 本発明の実施形態に従って、図１に示された分散データ処理環境などの分散データ処理環境内のユーザ固有の要約プログラムの機械学習コンポーネントの動作ステップを示すフローチャートである。2 is a flowchart illustrating the operational steps of a machine learning component of a user-specific summarization program in a distributed data processing environment, such as the distributed data processing environment shown in FIG. 1, according to an embodiment of the present invention. 本発明の実施形態に従う、図１に示された分散データ処理環境などの分散データ処理環境内のコンピューティング・デバイスのコンポーネントのブロック図である。2 is a block diagram of components of a computing device in a distributed data processing environment, such as the distributed data processing environment shown in FIG. 1, according to an embodiment of the present invention.

本発明の実施形態は、仮想コラボレーション・サーバ・ツール（virtual collaboration server tool）上の会議中に問題が発生する可能性があるということを認識している。例えば、技術的問題が、ユーザが会議から切断されることを引き起こすことがある。この切断に起因して、ユーザは、問題またはトピックに関するホストまたは別の参加者の情報提供、別の参加者の質問または懸念に対するホストの応答、参加者によって取られた行動の経過、あるいは会議から切断されたユーザに向けられている期待される重要点および要処置事項などの、重要な情報を聞きもらすことがある。 Embodiments of the present invention recognize that problems may occur during a conference on a virtual collaboration server tool. For example, technical issues may cause a user to be disconnected from the conference. Due to this disconnection, the user may miss important information, such as a host's or another participant's information about an issue or topic, a host's response to another participant's question or concern, a progress report of actions taken by a participant, or expected takeaways and action items directed to the user who was disconnected from the conference.

本発明の実施形態は、仮想会議ユーザ・エクスペリエンスを改善するための拡張インテリジェンスに基づくシステムおよび方法を提供する。本発明の実施形態は、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする、要約を準備することを提案する。 Embodiments of the present invention provide an augmented intelligence-based system and method for improving the virtual conference user experience. Embodiments of the present invention propose preparing a summary tailored to the profile of a user who is away from a conference on a virtual collaboration server tool, covering the portion of the conference during which the user was disconnected.

本発明の実施形態は、ユーザが明示的にまたは暗黙的に会議から離れたときを検出する。ユーザの離脱を検出した後に、本発明の実施形態は、（１）マージおよび（２）抽出テキスト要約アルゴリズムを使用する音声テキスト要約（speech-to-text summarization）の概念を、ユーザが切り離されていた間の会議の一部から抽出された音声フレームまたはビデオ・フレームあるいはその両方に適用する。 Embodiments of the present invention detect when a user leaves a conference, either explicitly or implicitly. After detecting the user's departure, embodiments of the present invention apply the concepts of speech-to-text summarization using (1) merging and (2) extracted text summarization algorithms to the audio and/or video frames extracted from the portion of the conference during which the user was disconnected.

本発明の実施形態は、話者ダイアライゼーションを使用して、参加者の識別情報に従って文を同質のセグメントに分割し、参加者の文からキーワードを識別する。本発明の実施形態は、重み付けを参加者およびキーワードに割り当てる。本発明の実施形態は、識別されて重み付けされた参加者、識別されて重み付けされたキーワード、ならびにランク付けされた文を組み込むグローバルなランク付けを準備する。 Embodiments of the present invention use speaker diarization to divide sentences into homogeneous segments according to participant identification information and identify keywords from the participants' sentences. Embodiments of the present invention assign weights to participants and keywords. Embodiments of the present invention prepare a global ranking that incorporates the identified and weighted participants, the identified and weighted keywords, and the ranked sentences.

本発明の実施形態は、３０秒のビデオまたは１ページの長さのテキストのどちらかの形態で、文を短い要約に減らす。本発明の実施形態は、各ユーザの企業プロフィール、ユーザの企業の階層、企業の階層内のユーザの地位、ユーザの企業内のユーザの年功序列、ユーザの主要な仕事、ユーザの主要な仕事の職務、ユーザの技術に関する関心、ユーザの専門知識、およびユーザがホストしたか、参加したか、または出席した会議への関与のユーザの履歴などの要因を使用して、仮想コラボレーション・サーバ・ツール上で会議から離れているユーザのプロフィールに合わせて要約を調整する。本発明の実施形態は、要約プロセスの一部として、計算された音声認識信頼度スコアおよび意図信頼度スコアも考慮する。 Embodiments of the present invention reduce sentences to short summaries in the form of either a 30-second video or a page of text. Embodiments of the present invention tailor summaries to the profile of users away from the conference on the virtual collaboration server tool using factors such as each user's company profile, the user's company hierarchy, the user's position within the company hierarchy, the user's seniority within the user's company, the user's primary job, the user's primary job function, the user's technology interests, the user's expertise, and the user's history of involvement in conferences the user has hosted, participated in, or attended. Embodiments of the present invention also consider calculated speech recognition confidence scores and intent confidence scores as part of the summarization process.

本発明の実施形態は、他の参加者とのユーザの関係の重みを活用して、ユーザが会議に再接続した後に、ユーザにいつ要約の使用を促すかを決定する。本発明の実施形態は、会議中または会議後に要約を再生するための選択肢をユーザに提供する。本発明の実施形態は、音声、ビデオ、字幕、またはこれら３つの形式の任意の組合せを含む要約を再生するための選択肢もユーザに提供する。 Embodiments of the present invention leverage the weight of a user's relationships with other participants to determine when to prompt the user to use the summary after the user reconnects to the conference. Embodiments of the present invention provide the user with the option to play the summary during or after the conference. Embodiments of the present invention also provide the user with the option to play a summary that includes audio, video, subtitles, or any combination of these three formats.

本発明の実施形態は、要約の未来の準備を改善するために、ユーザからフィードバックを収集する。 Embodiments of the present invention collect feedback from users to improve future preparation of summaries.

本発明の実施形態の実装は、さまざまな形態をとってよく、以下では、各図を参照して例示的な実装の詳細が説明される。 Implementations of embodiments of the present invention may take a variety of forms, and exemplary implementation details are described below with reference to the figures.

図１は、本発明の１つの実施形態に従って、分散データ処理環境（概して１００と指定される）を示す機能ブロック図である。示された実施形態では、分散データ処理環境１００は、ネットワーク１１０を経由して相互接続されたサーバ１２０およびユーザ・コンピューティング・デバイス１３０を含んでいる。分散データ処理環境１００は、図に示されていない追加のサーバ、コンピュータ、コンピューティング・デバイス、ＩｏＴセンサ、および他のデバイスを含んでよい。図１は、単に本発明の１つの実施形態の例を提供しており、さまざまな実施形態が実装され得る環境に関して、どのような制限も意味していない。特許請求の範囲に列挙されている本発明の範囲から逸脱することなく、当業者によって、示された環境に対する多くの変更が行われてよい。 FIG. 1 is a functional block diagram illustrating a distributed data processing environment (generally designated 100) in accordance with one embodiment of the present invention. In the illustrated embodiment, distributed data processing environment 100 includes servers 120 and user computing devices 130 interconnected via network 110. Distributed data processing environment 100 may include additional servers, computers, computing devices, IoT sensors, and other devices not shown. FIG. 1 provides merely an example of one embodiment of the present invention and is not intended to imply any limitations with regard to the environments in which various embodiments may be implemented. Many modifications to the illustrated environment may be made by those skilled in the art without departing from the scope of the present invention, as set forth in the claims.

ネットワーク１１０は、例えば、電気通信ネットワーク、ローカル・エリア・ネットワーク、インターネットなどの広域ネットワーク、またはこれらの３つの組合せであることができ、有線接続、ワイヤレス接続、または光ファイバ接続を含むことができる、コンピューティング・ネットワークとして動作する。ネットワーク１１０は、データ情報、音声情報、およびビデオ情報を含んでいるマルチメディア信号を含むデータ信号、音声信号、またはビデオ信号、あるいはその組合せを受信および送信できる１つまたは複数の有線ネットワークまたはワイヤレス・ネットワークあるいはその両方を含むことができる。一般に、ネットワーク１１０は、分散データ処理環境１００内のサーバ１２０、ユーザ・コンピューティング・デバイス１３０、および他のコンピューティング・デバイス（図示されていない）の間の通信をサポートする接続およびプロトコルの任意の組合せであることができる。 Network 110 operates as a computing network, which may be, for example, a telecommunications network, a local area network, a wide area network such as the Internet, or a combination of the three, and may include wired, wireless, or fiber optic connections. Network 110 may include one or more wired and/or wireless networks capable of receiving and transmitting data, audio, and/or video signals, including multimedia signals containing data, audio, and video information. In general, network 110 may be any combination of connections and protocols supporting communication between server 120, user computing devices 130, and other computing devices (not shown) in distributed data processing environment 100.

サーバ１２０は、ユーザ固有の要約プログラム１２２を実行し、データをデータベース１２６に送信するか、または格納するか、あるいはその両方を実行するように動作する。実施形態では、サーバ１２０は、データをデータベース１２６からユーザ・コンピューティング・デバイス１３０に送信することができる。実施形態では、サーバ１２０は、データベース１２６内のデータをユーザ・コンピューティング・デバイス１３０から受信することができる。１つまたは複数の実施形態では、サーバ１２０は、スタンドアロン・コンピューティング・デバイス、管理サーバ、Ｗｅｂサーバ、モバイル・コンピューティング・デバイス、またはデータを受信、送信、および処理することができる任意の他の電子デバイスもしくはコンピューティング・システムであることができる。１つまたは複数の実施形態では、サーバ１２０は、クラウド・コンピューティング環境内などの分散データ処理環境１００内でアクセスされたときにシームレスなリソースの単一のプールとして機能するクラスタ化されたコンピュータおよびコンポーネント（例えば、データベース・サーバ・コンピュータ、アプリケーション・サーバ・コンピュータなど）を利用する、コンピューティング・システムであることができる。１つまたは複数の実施形態では、サーバ１２０は、ラップトップ・コンピュータ、タブレット・コンピュータ、ネットブック・コンピュータ、パーソナル・コンピュータ、デスクトップ・コンピュータ、パーソナル・デジタル・アシスタント、スマートフォン、またはネットワーク１１０を介して分散データ処理環境１００内のユーザ・コンピューティング・デバイス１３０および他のコンピューティング・デバイス（図示されていない）と通信できる任意のプログラム可能な電子デバイスであることができる。サーバ１２０は、図５さらに詳細に示され、説明されるように、内部および外部のハードウェア・コンポーネントを含んでよい。 The server 120 operates to execute a user-specific summarization program 122 and transmit and/or store data in a database 126. In an embodiment, the server 120 may transmit data from the database 126 to a user computing device 130. In an embodiment, the server 120 may receive data in the database 126 from a user computing device 130. In one or more embodiments, the server 120 may be a standalone computing device, an administrative server, a web server, a mobile computing device, or any other electronic device or computing system capable of receiving, transmitting, and processing data. In one or more embodiments, the server 120 may be a computing system utilizing clustered computers and components (e.g., database server computers, application server computers, etc.) that function as a single pool of seamless resources when accessed within a distributed data processing environment 100, such as within a cloud computing environment. In one or more embodiments, server 120 may be a laptop computer, tablet computer, netbook computer, personal computer, desktop computer, personal digital assistant, smartphone, or any programmable electronic device capable of communicating with user computing devices 130 and other computing devices (not shown) in distributed data processing environment 100 via network 110. Server 120 may include internal and external hardware components, as shown and described in more detail in FIG. 5.

ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする、要約を準備するように動作する。 The user-specific summary program 122 operates to prepare a summary tailored to the profile of a user who is away from a conference on the virtual collaboration server tool, covering the portion of the conference while the user was disconnected.

示された実施形態では、ユーザ固有の要約プログラム１２２は、機械学習コンポーネント１２４を含む。示された実施形態では、ユーザ固有の要約プログラム１２２は、スタンドアロン・プログラムである。別の実施形態では、ユーザ固有の要約プログラム１２２は、仮想会議ソフトウェア・パッケージなどの別のソフトウェア製品に統合されてよい。示された実施形態では、ユーザ固有の要約プログラム１２２はサーバ１２０に存在する。他の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ固有の要約プログラム１２２がネットワーク１１０にアクセスできるということを条件として、ユーザ・コンピューティング・デバイス１３０または別のコンピューティング・デバイス（図示されていない）に存在してよい。 In the illustrated embodiment, the user-specific summarization program 122 includes a machine learning component 124. In the illustrated embodiment, the user-specific summarization program 122 is a standalone program. In another embodiment, the user-specific summarization program 122 may be integrated into another software product, such as a virtual conferencing software package. In the illustrated embodiment, the user-specific summarization program 122 resides on the server 120. In other embodiments, the user-specific summarization program 122 may reside on the user computing device 130 or another computing device (not shown), provided that the user-specific summarization program 122 has access to the network 110.

実施形態では、ユーザは、ユーザ固有の要約プログラム１２２にオプトインし、ユーザ固有の要約プログラム１２２を使用してユーザ・プロフィールを設定する。ユーザ固有の要約プログラム１２２の設定コンポーネントが、図２に関してさらに詳細に示され、説明される。ユーザ固有の要約プログラム１２２の動作ステップが、図３に関してさらに詳細に示され、説明される。ユーザ固有の要約プログラム１２２の機械学習コンポーネント１２４の動作ステップが、図４に関してさらに詳細に示され、説明される。 In an embodiment, a user opts in to the user-specific summarization program 122 and configures a user profile using the user-specific summarization program 122. The configuration components of the user-specific summarization program 122 are shown and described in further detail with respect to FIG. 2. The operational steps of the user-specific summarization program 122 are shown and described in further detail with respect to FIG. 3. The operational steps of the machine learning component 124 of the user-specific summarization program 122 are shown and described in further detail with respect to FIG. 4.

データベース１２６は、ユーザ固有の要約プログラム１２２によって受信されたデータ、使用されたデータ、または生成されたデータ、あるいはその組合せのリポジトリとして動作する。データベースは、データの構造化された集合である。データは、各ユーザの企業プロフィール、ユーザの企業の階層、企業の階層内のユーザの地位、ユーザの企業内のユーザの年功序列、ユーザの主要な仕事、ユーザの主要な仕事の職務、ユーザの技術に関する関心、およびユーザの専門知識に関する設定中にユーザによって入力された情報を含む複数のユーザ・プロフィール、仮想コラボレーション・サーバ・ツール上の現在の会議からのデータ、またはユーザがホストしたか、参加したか、または出席した仮想コラボレーション・サーバ・ツール上の以前の会議からのデータ、あるいはその両方（すなわち、音声フレーム、ビデオ・フレーム、または音声およびビデオ・フレーム）、ユーザの嗜好、警告通知の嗜好、ならびにユーザ固有の要約プログラム１２２によって受信されたか、使用されたか、または生成されたか、あるいはその組合せである任意の他のデータを含むが、これらに限定されない。 The database 126 acts as a repository of data received, used, and/or generated by the user-specific summarization program 122. The database is a structured collection of data. The data includes, but is not limited to, a plurality of user profiles including information entered by the user during setup regarding each user's company profile, the user's company hierarchy, the user's position within the company hierarchy, the user's seniority within the user's company, the user's primary job, the user's primary job function, the user's technology interests, and the user's expertise; data from a current meeting on the virtual collaboration server tool and/or data from previous meetings on the virtual collaboration server tool that the user hosted, participated in, or attended; user preferences; alert notification preferences; and any other data received, used, and/or generated by the user-specific summarization program 122.

データベース１２６は、ハード・ディスク・ドライブ、データベース・サーバ、またはフラッシュ・メモリなどの、サーバ１２０によってアクセスされて利用され得るデータおよび構成ファイルを格納できる任意の種類のデバイスを使用して実装され得る。実施形態では、データベース１２６は、データを格納するか、またはデータにアクセスするか、あるいはその両方のために、ユーザ固有の要約プログラム１２２によってアクセスされる。示された実施形態では、データベース１２６はサーバ１２０に存在する。別の実施形態では、データベース１２６は、ユーザ固有の要約プログラム１２２がデータベース１２６にアクセスできるということを条件として、別のコンピューティング・デバイス、サーバ、クラウド・サーバに存在するか、または分散データ処理環境１００内のどこかの複数のデバイス（図示されていない）にわたって分散されてよい。 Database 126 may be implemented using any type of device capable of storing data and configuration files that can be accessed and utilized by server 120, such as a hard disk drive, database server, or flash memory. In an embodiment, database 126 is accessed by user-specific summary program 122 to store data, access data, or both. In the illustrated embodiment, database 126 resides on server 120. In another embodiment, database 126 may reside on another computing device, server, cloud server, or be distributed across multiple devices (not shown) anywhere in distributed data processing environment 100, provided that user-specific summary program 122 has access to database 126.

本発明は、ユーザが処理されないことを望む個人または企業の機密あるいはその両方のデータ、コンテンツ、または情報を含み得る、データベース１２６などの、さまざまなアクセス可能なデータ・ソースを含んでよい。処理とは、任意の自動化されたか、または自動化されない動作、あるいは一連の動作のことを指し、そのような動作としては、個人データまたは企業機密データあるいはその両方の収集、記録、編成、構造化、格納、適応、変更、検索、参照、使用、送信、配布、または他の方法で使用可能にすることによる開示、結合、制限、消去、あるいは破壊などがある。ユーザ固有の要約プログラム１２２は、個人データの許可された安全な処理を可能にする。データベース１２６内のデータのすべての格納は、現地法および取得された適切な許可に従って実行されなければならない。 The present invention may involve a variety of accessible data sources, such as database 126, which may contain personal and/or business-sensitive data, content, or information that a user desires not to be processed. Processing refers to any automated or non-automated action or set of actions, such as collecting, recording, organizing, structuring, storing, adapting, altering, retrieving, consulting, using, disclosing by transmitting, disseminating, or otherwise making available, combining, restricting, erasing, or destroying personal and/or business-sensitive data. The user-specific abstraction program 122 enables authorized and secure processing of personal data. All storage of data in database 126 must be performed in accordance with local law and any appropriate authorizations obtained.

ユーザ固有の要約プログラム１２２は、個人データまたは企業機密データあるいはその両方の収集の通知と共に、情報に基づく同意(informed consent)を提供し、ユーザが、個人データまたは企業機密データあるいはその両方を処理することをオプトインまたはオプトアウトできるようにする。同意は、複数の形態をとることができる。オプトインの同意は、個人データまたは企業機密データあるいはその両方が処理される前に、積極的行動(affirmative action)を取ることをユーザに強制することができる。代替として、オプトアウトの同意は、個人データまたは企業機密データあるいはその両方が処理される前に、個人データまたは企業機密データあるいはその両方の処理を防ぐための積極的行動を取ることをユーザに強制することができる。ユーザ固有の要約プログラム１２２は、個人データまたは企業機密データあるいはその両方および処理の性質（例えば、種類、範囲、目的、持続時間など）に関する情報を提供する。ユーザ固有の要約プログラム１２２は、格納された個人データまたは企業機密データあるいはその両方のコピーをユーザに提供する。ユーザ固有の要約プログラム１２２は、正しくないか、または不完全な個人データまたは企業機密データあるいはその両方の修正または完成を可能にする。ユーザ固有の要約プログラム１２２は、個人データまたは企業機密データあるいはその両方の即時の削除を可能にする。 The user-specific summary program 122 provides informed consent along with notice of the collection of personal data and/or confidential business data, allowing the user to opt in or out of processing the personal data and/or confidential business data. Consent can take multiple forms. Opt-in consent can force the user to take affirmative action before the personal data and/or confidential business data are processed. Alternatively, opt-out consent can force the user to take affirmative action to prevent the processing of the personal data and/or confidential business data before the personal data and/or confidential business data are processed. The user-specific summary program 122 provides information about the personal data and/or confidential business data and the nature of the processing (e.g., type, scope, purpose, duration, etc.). The user-specific summary program 122 provides the user with a copy of the stored personal data and/or confidential business data. The user-specific summarization program 122 allows for the correction or completion of incorrect or incomplete personal data and/or confidential business data. The user-specific summarization program 122 allows for the immediate deletion of personal data and/or confidential business data.

ユーザ・コンピューティング・デバイス１３０は、ユーザ・インターフェイス１３２を実行するように動作し、ユーザに関連付けられる。実施形態では、ユーザ・コンピューティング・デバイス１３０は、ラップトップ・コンピュータ、タブレット・コンピュータ、ネットブック・コンピュータ、パーソナル・コンピュータ、デスクトップ・コンピュータ、スマートフォン、またはユーザ・インターフェイス１３２を実行し、ネットワーク１１０を介してユーザ固有の要約プログラム１２２と通信する（すなわち、データを送信し、データを受信する）ことができる任意のプログラム可能な電子デバイスなどの、電子デバイスであってよい。示された実施形態では、ユーザ・コンピューティング・デバイス１３０は、ユーザ・インターフェイス１３２のインスタンスを含む。図５でさらに詳細に説明されるように、ユーザ・コンピューティング・デバイス１３０はコンポーネントを含んでよい。 The user computing device 130 operates to execute the user interface 132 and is associated with a user. In an embodiment, the user computing device 130 may be an electronic device such as a laptop computer, a tablet computer, a netbook computer, a personal computer, a desktop computer, a smartphone, or any programmable electronic device capable of executing the user interface 132 and communicating (i.e., sending data to and receiving data from) the user-specific summary program 122 over the network 110. In the illustrated embodiment, the user computing device 130 includes an instance of the user interface 132. The user computing device 130 may include components as described in further detail in FIG. 5.

ユーザ・インターフェイス１３２は、サーバ１２０上のユーザ固有の要約プログラム１２２とユーザ・コンピューティング・デバイス１３０のユーザの間のローカル・ユーザ・インターフェイスとして動作する。一部の実施形態では、ユーザ・インターフェイス１３２は、グラフィカル・ユーザ・インターフェイス（ＧＵＩ：graphical user interface）、Ｗｅｂユーザ・インターフェイス（ＷＵＩ：web user interface）、または音声ユーザ・インターフェイス（ＶＵＩ：voice user interface）、あるいはその組合せであり、ネットワーク１１０を介してユーザ固有の要約プログラム１２２からユーザに送信されたテキスト、文書、Ｗｅｂブラウザのウィンドウ、ユーザの選択肢、アプリケーション・インターフェイス、および動作のための命令を、表示する（すなわち、視覚的に表示する）か、または提示する（すなわち、聞こえるように提示する）ことができる。ユーザ・インターフェイス１３２は、ネットワーク１１０を介してユーザ固有の要約プログラム１２２からユーザに送信された情報（例えば、グラフィックス、テキスト、または音、あるいはその組合せ）を含んでいる警告通知を表示または提示することもできる。実施形態では、ユーザ・インターフェイス１３２は、データを送信および受信する（すなわち、ネットワーク１１０を介してユーザ固有の要約プログラム１２２との間でそれぞれ送信および受信する）ことができる。 The user interface 132 acts as a local user interface between the user-specific summary program 122 on the server 120 and the user of the user computing device 130. In some embodiments, the user interface 132 is a graphical user interface (GUI), a web user interface (WUI), or a voice user interface (VUI), or a combination thereof, and can display (i.e., visually display) or present (i.e., audibly present) text, documents, web browser windows, user options, application interfaces, and instructions for operation sent to the user from the user-specific summary program 122 over the network 110. The user interface 132 can also display or present alert notifications containing information (e.g., graphics, text, and/or sound) sent to the user from the user-specific summary program 122 over the network 110. In an embodiment, the user interface 132 is capable of transmitting and receiving data (i.e., to and from, respectively, the user-specific summary program 122 over the network 110).

ユーザ・インターフェイス１３２を介して、ユーザはユーザ固有の要約プログラム１２２にオプトインし、ユーザ・プロフィールを作成し、各ユーザの企業プロフィール、ユーザの企業の階層、企業の階層内のユーザの地位、ユーザの企業内のユーザの年功序列、ユーザの主要な仕事、ユーザの主要な仕事の職務、ユーザの技術に関する関心、およびユーザの専門知識に関する情報を入力し、ユーザの嗜好および警告通知の嗜好を設定することができる。 Through the user interface 132, users can opt in to the user-specific summary program 122, create a user profile, enter information about each user's company profile, the user's company hierarchy, the user's position within the company hierarchy, the user's seniority within the user's company, the user's primary job, the user's primary job function, the user's technology interests, and the user's expertise, and set user preferences and alert notification preferences.

ユーザの嗜好は、特定のユーザ用にカスタマイズされ得る設定である。デフォルトのユーザの嗜好のセットは、ユーザ固有の要約プログラム１２２の各ユーザに割り当てられる。デフォルトのユーザの嗜好を変更するように値を更新するために、ユーザ嗜好エディタ（user preference editor）が使用され得る。カスタマイズされ得るユーザの嗜好は、一般的なユーザ・システム設定、ユーザ固有の要約プログラム１２２に関する特定のユーザ・プロフィール設定、警告通知設定、および機械学習されるデータの収集／格納設定を含むが、これらに限定されない。機械学習されるデータは、ユーザ固有の要約プログラム１２２の反復の過去の結果、およびユーザ固有の要約プログラム１２２によって送信された警告通知に対するユーザの以前の応答に関するデータを含むが、これらに限定されない。機械学習されるデータは、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが離れていた間の会議の一部を対象にする、要約を準備する方法、および会議に再び参加する前に要約をレビューするようユーザに促すかどうかを自己学習する、ユーザ固有の要約プログラム１２２から来る。ユーザ固有の要約プログラム１２２は、ユーザの活動を追跡することによって自己学習し、ユーザ固有の要約プログラム１２２の各反復と共に改善する。 User preferences are settings that can be customized for a particular user. A set of default user preferences is assigned to each user of the user-specific summarization program 122. A user preference editor can be used to update values to change the default user preferences. User preferences that can be customized include, but are not limited to, general user system settings, specific user profile settings for the user-specific summarization program 122, alert notification settings, and machine-learned data collection/storage settings. Machine-learned data includes, but is not limited to, past results of iterations of the user-specific summarization program 122 and data regarding the user's previous responses to alert notifications sent by the user-specific summarization program 122. The machine-learned data comes from the user-specific summarization program 122, which is tailored to the user's profile when away from a meeting on the virtual collaboration server tool, self-learns how to prepare a summary covering the portion of the meeting while the user was away, and whether to prompt the user to review the summary before rejoining the meeting. The user-specific summarization program 122 self-learns by tracking the user's activity and improves with each iteration of the user-specific summarization program 122.

図２は、本発明の実施形態に従って、図１の分散データ処理環境１００内のユーザ固有の要約プログラム１２２の設定コンポーネントの動作ステップを示す、概して２００と指定されたフローチャートである。実施形態では、ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツール上の会議をホストするか、会議に参加するか、または会議に出席するユーザとの１回限りの設定を完了する。１回限りの設定は、ユーザ固有の要約プログラム１２２がユーザに関する関連情報を捕捉してユーザ・プロフィールを作成することを可能にする。実施形態では、ユーザ固有の要約プログラム１２２は、オプトインするための要求をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２はユーザに対して情報を要求する。実施形態では、ユーザ固有の要約プログラム１２２は、要求された情報をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを作成する。実施形態では、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを格納する。図２に示されたプロセスがユーザ固有の要約プログラム１２２の１つの可能な反復を示しているということが理解されるべきであり、このプロセスは、ユーザ固有の要約プログラム１２２によって受信されたオプトイン要求ごとに繰り返されてよい。 2 is a flowchart generally designated 200 illustrating operational steps of a setup component of a user-specific summary program 122 in the distributed data processing environment 100 of FIG. 1 in accordance with an embodiment of the present invention. In an embodiment, the user-specific summary program 122 completes a one-time setup with a user who will host, participate in, or attend a conference on the virtual collaboration server tool. The one-time setup enables the user-specific summary program 122 to capture relevant information about the user to create a user profile. In an embodiment, the user-specific summary program 122 receives a request to opt in from the user. In an embodiment, the user-specific summary program 122 requests information from the user. In an embodiment, the user-specific summary program 122 receives the requested information from the user. In an embodiment, the user-specific summary program 122 creates a user profile. In an embodiment, the user-specific summary program 122 stores the user profile. It should be understood that the process illustrated in FIG. 2 represents one possible iteration of the user-specific summary program 122, and that this process may be repeated for each opt-in request received by the user-specific summary program 122.

ステップ２１０で、ユーザ固有の要約プログラム１２２は、オプトインするための要求をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ固有の要約プログラム１２２にオプトインするための要求をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介して、ユーザ固有の要約プログラム１２２にオプトインするための要求をユーザから受信する。ユーザは、オプトインすることによって、データベース１２６とデータを共有することに同意する。 At step 210, the user-specific summary program 122 receives a request to opt in from the user. In an embodiment, the user-specific summary program 122 receives a request from the user to opt in to the user-specific summary program 122. In an embodiment, the user-specific summary program 122 receives a request from the user to opt in to the user-specific summary program 122 via the user interface 132 of the user computing device 130. By opting in, the user agrees to share their data with the database 126.

ステップ２２０で、ユーザ固有の要約プログラム１２２はユーザに対して情報を要求する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介してユーザに対して情報を要求する。ユーザから要求される情報は、ユーザの嗜好に関する情報（例えば、ユーザ・コンピューティング・デバイス１３０に関する警告通知などの一般的なユーザ・システム設定）、警告通知の嗜好に関する情報（例えば、ユーザが離れていた会議に再接続するときに警告通知が送信される、または、ユーザが離れていた会議が完了するときに警告通知が送信される）、およびユーザ・プロフィールを作成するために必要な情報（例えば、各ユーザの企業プロフィール、ユーザの企業の階層、企業の階層内のユーザの地位、ユーザの企業内のユーザの年功序列、ユーザの主要な仕事、ユーザの主要な仕事の職務、ユーザの技術に関する関心、およびユーザの専門知識に関する情報）を含むが、これらに限定されない。実施形態では、ユーザ固有の要約プログラム１２２がオプトインするための要求をユーザから受信することに応答して、ユーザ固有の要約プログラム１２２はユーザに対して情報を要求する。 In step 220, the user-specific summarization program 122 requests information from the user. In an embodiment, the user-specific summarization program 122 requests information from the user via the user interface 132 of the user computing device 130. The information requested from the user includes, but is not limited to, information about the user's preferences (e.g., general user system settings, such as alert notifications, for the user computing device 130), information about alert notification preferences (e.g., alert notifications are sent when the user reconnects to a conference they left or alert notifications are sent when a conference they left completes), and information necessary to create a user profile (e.g., information about each user's company profile, the user's company hierarchy, the user's position within the company hierarchy, the user's seniority within the user's company, the user's primary job, the user's primary job function, the user's technology interests, and the user's expertise). In an embodiment, in response to the user-specific summarization program 122 receiving a request to opt in from the user, the user-specific summarization program 122 requests information from the user.

ステップ２３０で、ユーザ固有の要約プログラム１２２は、要求された情報をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介して、要求された情報をユーザから受信する。実施形態では、ユーザ固有の要約プログラム１２２がユーザに対して情報を要求することに応答して、ユーザ固有の要約プログラム１２２は要求された情報をユーザから受信する。 In step 230, the user-specific summary program 122 receives the requested information from the user. In an embodiment, the user-specific summary program 122 receives the requested information from the user via the user interface 132 of the user computing device 130. In an embodiment, the user-specific summary program 122 receives the requested information from the user in response to the user-specific summary program 122 requesting information from the user.

ステップ２４０で、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを作成する。実施形態では、ユーザ固有の要約プログラム１２２はユーザのユーザ・プロフィールを作成する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザに関する設定の間にユーザによって入力された情報に加えて、ユーザの嗜好およびユーザの警告通知の嗜好を含むユーザ・プロフィールを作成する。実施形態では、ユーザ固有の要約プログラム１２２が要求された情報をユーザから受信することに応答して、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを作成する。 In step 240, the user-specific summary program 122 creates a user profile. In an embodiment, the user-specific summary program 122 creates a user profile for the user. In an embodiment, the user-specific summary program 122 creates a user profile that includes the user's preferences and the user's alert notification preferences in addition to information entered by the user during setup for the user. In an embodiment, in response to the user-specific summary program 122 receiving the requested information from the user, the user-specific summary program 122 creates the user profile.

ステップ２５０で、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを格納する。実施形態では、ユーザ固有の要約プログラム１２２はユーザ・プロフィールをデータベース（例えば、データベース１２６）に格納する。実施形態では、ユーザ固有の要約プログラム１２２がユーザ・プロフィールを作成することに応答して、ユーザ固有の要約プログラム１２２はユーザ・プロフィールを格納する。 In step 250, the user-specific summary program 122 stores the user profile. In an embodiment, the user-specific summary program 122 stores the user profile in a database (e.g., database 126). In an embodiment, in response to the user-specific summary program 122 creating the user profile, the user-specific summary program 122 stores the user profile.

図３は、本発明の実施形態に従って、図１の分散データ処理環境１００内のユーザ固有の要約プログラム１２２の動作ステップを示す、概して３００と指定されたフローチャートである。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが仮想コラボレーション・サーバ・ツール上の会議から離れたときを検出することと、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする、要約を準備することと、ユーザが会議に再接続するときにユーザに要約を提供することと、未来の反復でより調整された要約を準備することにおいてユーザ固有の要約プログラム１２２を支援するために、ユーザからフィードバックを収集することとを実行するように動作する。図３に示されたプロセスがプロセス・フローの１つの可能な反復を示しているということが理解されるべきであり、このプロセスは、ユーザが仮想コラボレーション・サーバ・ツール上でホストするか、参加するか、または出席する会議ごとに繰り返され得る。 FIG. 3 is a flowchart generally designated 300 illustrating operational steps of the user-specific summarization program 122 in the distributed data processing environment 100 of FIG. 1 in accordance with an embodiment of the present invention. In an embodiment, the user-specific summarization program 122 operates to detect when a user disconnects from a conference on the virtual collaboration server tool, prepare a summary tailored to the user's profile while disconnected from the conference on the virtual collaboration server tool and covering the portion of the conference while the user was disconnected, provide the summary to the user when the user reconnects to the conference, and collect feedback from the user to assist the user-specific summarization program 122 in preparing more tailored summaries in future iterations. It should be understood that the process depicted in FIG. 3 illustrates one possible iteration of the process flow, and this process may be repeated for each conference the user hosts, participates in, or attends on the virtual collaboration server tool.

ステップ３０５で、ユーザ固有の要約プログラム１２２は、ユーザが仮想コラボレーション・サーバ・ツール（例えば、ＣｉｓｃｏＷｅｂｅｘ（Ｒ）、Ｚｏｏｍ（Ｒ）、ＧｏｏｇｌｅＭｅｅｔ（Ｒ）、Ｍｉｃｒｏｓｏｆｔ（Ｒ）Ｔｅａｍｓなど）上の会議から離れたときを検出する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが、１人または複数のホストおよび２人以上の参加者が存在する仮想コラボレーション・サーバ・ツール上の会議から離れたときを検出する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが、ホストまたは参加者あるいはその両方が１つまたは複数のプレゼンテーション・ツール（例えば、Ｍｉｃｒｏｓｏｆｔ（Ｒ）ＰｏｗｅｒＰｏｉｎｔスライド、Ｍｉｃｒｏｓｏｆｔ（Ｒ）Ｅｘｃｅｌファイル、スプレッドシート、Ｗｅｂページ、ダイアグラム、フローチャートなど）を利用している仮想コラボレーション・サーバ・ツール上の会議から離れたときを検出する。 In step 305, the user-specific summarizing program 122 detects when the user leaves a conference on a virtual collaboration server tool (e.g., Cisco Webex®, Zoom®, Google Meet®, Microsoft® Teams, etc.). In an embodiment, the user-specific summarizing program 122 detects when the user leaves a conference on a virtual collaboration server tool in which one or more hosts and two or more participants are present. In an embodiment, the user-specific summarizing program 122 detects when the user leaves a conference on a virtual collaboration server tool in which the host and/or participants are utilizing one or more presentation tools (e.g., Microsoft® PowerPoint slides, Microsoft® Excel files, spreadsheets, web pages, diagrams, flowcharts, etc.).

実施形態では、ユーザ固有の要約プログラム１２２は、ユーザによって開始されたユーザの明示的な離脱（例えば、ユーザが状態を不在に変更する、ユーザが会議からサインアウトするなど）を検出する。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、技術的問題（例えば、ネットワーク接続の問題、デバイスの故障、停電など）によって引き起こされたユーザの暗黙的な離脱を検出する。 In an embodiment, the user-specific summary program 122 detects explicit user departures initiated by the user (e.g., the user changes their status to Away, the user signs out of the meeting, etc.). In one or more embodiments, the user-specific summary program 122 detects implicit user departures caused by technical issues (e.g., network connection issues, device failure, power outage, etc.).

第１の例では、コンピュータ科学の学生であるユーザＡが、仮想コラボレーション・サーバ・ツール上のユーザＡのオンライン仮想クラスに出席する。ある日、クラスに出席している間に、ユーザＡはネットワーク接続の問題を経験する。ユーザＡが経験するネットワーク接続の問題のため、ユーザＡは、ユーザＡのオンライン仮想クラスから切断される。ユーザ固有の要約プログラム１２２は、ユーザＡのオンライン仮想クラスからのユーザＡの暗黙的な離脱を検出する。 In a first example, User A, a computer science student, attends User A's online virtual class on a virtual collaboration server tool. One day, while attending class, User A experiences network connection problems. Due to the network connection problems experienced by User A, User A is disconnected from User A's online virtual class. The user-specific summary program 122 detects User A's implicit withdrawal from User A's online virtual class.

第２の例では、技術企業の従業員であるユーザＢが、自宅から作業する。ユーザＢは、仮想コラボレーション・サーバ・ツール上でユーザＢのチームと頻繁にチーム会議を開く。ユーザＢのチーム会議のうちの１つの間に、ユーザＢは、個人的な緊急事態に対処するためにチーム会議との接続を切る。ユーザ固有の要約プログラム１２２は、ユーザＢのチーム会議からのユーザＢの明示的な離脱を検出する。 In a second example, User B, an employee of a technology company, works from home. User B frequently holds team meetings with User B's team on a virtual collaboration server tool. During one of User B's team meetings, User B disconnects from the team meeting to attend to a personal emergency. The user-specific summary program 122 detects User B's explicit withdrawal from User B's team meeting.

第３の例では、都市Ｘの納税者であるユーザＣが、仮想コラボレーション・サーバ・ツール上の都市Ｘのオンライン市議会に出席する。ユーザＣが出席していた都市Ｘのオンライン市議会のうちの１つの間に、ユーザＣは、休憩するために会議との接続を切る。ユーザ固有の要約プログラム１２２は、都市Ｘのオンライン市議会からのユーザＣの明示的な離脱を検出する。 In a third example, User C, a taxpayer for City X, attends City X's online city council meetings on the virtual collaboration server tool. During one of the City X online city council meetings that User C is attending, User C disconnects from the meeting to take a break. The user-specific summary program 122 detects User C's explicit departure from City X's online city council meeting.

実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが、１０分より長いが３０分より短い事前設定された期間の間、仮想コラボレーション・サーバ・ツール上の会議から離れたときを検出する。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが、事前にスケジュールされた会議の割り当てられた合計時間の事前設定された割合の間、仮想コラボレーション・サーバ・ツール上の会議から離れたとき（例えば、ユーザが、事前にスケジュールされた会議の割り当てられた合計時間（例えば、６０分）の１０パーセント～２５パーセント（例えば、６～１５分）の間、離れた）かを検出する。 In an embodiment, the user-specific summary program 122 detects when a user has been away from a meeting on the virtual collaboration server tool for a preset period of time greater than 10 minutes but less than 30 minutes. In one or more embodiments, the user-specific summary program 122 detects when a user has been away from a meeting on the virtual collaboration server tool for a preset percentage of the pre-scheduled meeting's total allotted time (e.g., the user has been away for 10 to 25 percent (e.g., 6 to 15 minutes) of the pre-scheduled meeting's total allotted time (e.g., 60 minutes)).

実施形態では、ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツール上の会議の開始時間を捕捉する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの離脱の開始時間（すなわち、ユーザが明示的または暗黙的に切断したときの会議中の時間）を捕捉する。 In an embodiment, the user-specific summary program 122 captures the start time of a conference on the virtual collaboration server tool. In an embodiment, the user-specific summary program 122 captures the start time of the user's departure (i.e., the time during the conference when the user explicitly or implicitly disconnects).

ステップ３１０で、ユーザ固有の要約プログラム１２２はデータを取り出す。実施形態では、ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする要約を準備する目的で、データを取り出す。 In step 310, the user-specific summarization program 122 retrieves the data. In an embodiment, the user-specific summarization program 122 retrieves the data for the purpose of preparing a summary tailored to the profile of the user away from the conference on the virtual collaboration server tool, covering the portion of the conference while the user was disconnected.

実施形態では、ユーザ固有の要約プログラム１２２は、ユーザに関するデータを取り出す。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、会議の他の参加者に関するデータを取り出す。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザと会議の他の参加者の間の接続に関するデータを取り出す。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、参加者の特定の呼び出し、参加者によって尋ねられた質問、および参加者によって尋ねられた質問に対して提供された回答を含む、参加者間の対話に関するデータを取り出す。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、会議のトピックに関するデータを取り出す。 In embodiments, the user-specific summarization program 122 retrieves data about the user. In one or more embodiments, the user-specific summarization program 122 retrieves data about other participants in the conference. In one or more embodiments, the user-specific summarization program 122 retrieves data about connections between the user and other participants in the conference. In one or more embodiments, the user-specific summarization program 122 retrieves data about interactions between participants, including specific calls of participants, questions asked by participants, and answers provided to questions asked by participants. In one or more embodiments, the user-specific summarization program 122 retrieves data about the topic of the conference.

実施形態では、ユーザ固有の要約プログラム１２２は、ステップ２４０で作成されたユーザ・プロフィール、ユーザの企業プロフィール、ユーザのカレンダー、他の参加者のユーザ・プロフィール、他の参加者の企業プロフィール、他の参加者のカレンダー、ホストまたは参加者あるいはその両方によって使用される１つまたは複数のプレゼンテーション・ツール（例えば、Ｍｉｃｒｏｓｏｆｔ（Ｒ）ＰｏｗｅｒＰｏｉｎｔスライド、Ｍｉｃｒｏｓｏｆｔ（Ｒ）Ｅｘｃｅｌファイル、スプレッドシート、Ｗｅｂページ、ダイアグラム、フローチャートなど）、およびユーザがホストしたか、参加したか、または出席した以前の会議からデータベース（例えば、データベース１２６）に格納されたデータを含むが、これらに限定されないソースから、データを取り出す。データを取り出すユーザ固有の要約プログラム１２２の例は、本明細書では個別の方法を使用して説明されるが、ユーザ固有の要約プログラム１２２が上記の実施形態の１つまたは複数の組合せを介してデータを取り出してよいということに、注意するべきである。実施形態では、ユーザ固有の要約プログラム１２２が、ユーザが仮想コラボレーション・サーバ・ツール上の会議から離れたときを検出することに応答して、ユーザ固有の要約プログラム１２２はデータを取り出す。 In an embodiment, the user-specific summarization program 122 retrieves data from sources including, but not limited to, the user profile created in step 240, the user's company profile, the user's calendar, other participants' user profiles, other participants' company profiles, other participants' calendars, one or more presentation tools (e.g., Microsoft® PowerPoint slides, Microsoft® Excel files, spreadsheets, web pages, diagrams, flowcharts, etc.) used by the host and/or participants, and data stored in a database (e.g., database 126) from previous meetings hosted, participated in, or attended by the user. While examples of the user-specific summarization program 122 retrieving data are described herein using individual methods, it should be noted that the user-specific summarization program 122 may retrieve data via one or more combinations of the above embodiments. In an embodiment, the user-specific summarization program 122 retrieves data in response to the user-specific summarization program 122 detecting when the user has left the meeting on the virtual collaboration server tool.

ステップ３１５で、ユーザ固有の要約プログラム１２２は要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、（１）マージおよび（２）抽出テキスト要約アルゴリズムを使用する音声テキスト要約の概念を適用することによって、要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの離脱を検出した後に、本発明の実施形態は、（１）マージおよび（２）抽出テキスト要約アルゴリズムを使用する音声テキスト要約の概念を、ユーザが切り離されていた間の会議の一部から抽出された音声フレームまたはビデオ・フレームあるいはその両方に適用することによって、要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、（１）マージおよび（２）抽出テキスト要約アルゴリズムを使用する音声テキスト要約の概念を、ホストまたは参加者あるいはその両方によって使用される１つまたは複数のプレゼンテーション・ツール（例えば、Ｍｉｃｒｏｓｏｆｔ（Ｒ）ＰｏｗｅｒＰｏｉｎｔスライド、Ｍｉｃｒｏｓｏｆｔ（Ｒ）Ｅｘｃｅｌファイル、スプレッドシート、Ｗｅｂページ、ダイアグラム、フローチャートなど）に適用することによって、要約を準備する。 In step 315, the user-specific summarization program 122 prepares a summary. In an embodiment, the user-specific summarization program 122 prepares a summary tailored to the profile of the user who is away from the conference on the virtual collaboration server tool, covering the portion of the conference while the user was disconnected. In an embodiment, the user-specific summarization program 122 prepares the summary by applying concepts of speech-to-text summarization using (1) merging and (2) extracting text summarization algorithms. In an embodiment, after the user-specific summarization program 122 detects the user's departure, an embodiment of the present invention prepares the summary by applying concepts of speech-to-text summarization using (1) merging and (2) extracting text summarization algorithms to audio frames and/or video frames extracted from the portion of the conference while the user was disconnected. In an embodiment, the user-specific summarization program 122 prepares the summary by applying the concepts of speech-to-text summarization using (1) merging and (2) extractive text summarization algorithms to one or more presentation tools (e.g., Microsoft® PowerPoint slides, Microsoft® Excel files, spreadsheets, web pages, diagrams, flowcharts, etc.) used by the host and/or participants.

抽出テキスト要約アルゴリズムは、重要な文および他の際立った情報を特定の音声、ビデオ、またはテキスト・ファイル、あるいはその組合せから識別して抽出することによって適用される。テキスト・ファイルから抽出された文に重みが割り当てられ、文の特定の重みに基づいて文がランク付けされる。高くランク付けされた文が一緒にグループ化されて、簡潔な要約を形成する。別の方法で表すと、抽出テキスト要約アルゴリズムは、テキストの重要なセクションを識別して逐語的に生成し、元のテキストから文のサブセットを要約として生成することによって、適用される。 Extractive text summarization algorithms are applied by identifying and extracting important sentences and other salient information from a particular audio, video, or text file, or a combination thereof. Weights are assigned to the sentences extracted from the text file, and the sentences are ranked based on their particular weight. Highly ranked sentences are grouped together to form a concise summary. Stated another way, extractive text summarization algorithms are applied by identifying and generating verbatim important sections of text, and generating a subset of sentences from the original text as a summary.

ステップ３１５は、図４のフローチャート４００に関してさらに詳細に説明される。実施形態では、ユーザ固有の要約プログラム１２２がデータを取り出すことに応答して、ユーザ固有の要約プログラム１２２は要約を準備する。 Step 315 is described in further detail with respect to flowchart 400 of FIG. 4. In an embodiment, in response to the user-specific summary program 122 retrieving the data, the user-specific summary program 122 prepares a summary.

ステップ３２０で、ユーザ固有の要約プログラム１２２は、ユーザが仮想コラボレーション・サーバ・ツール上の会議に再接続するとき検出する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの再関与時間（すなわち、ユーザが再接続したときの会議中の時間）を捕捉する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの離脱期間の全持続時間（すなわち、ユーザが会議から切り離されていた時間の全持続時間）を計算する。ユーザ固有の要約プログラム１２２が、ユーザの離脱期間の全持続時間が、事前設定された期間、または事前にスケジュールされた会議の割り当てられた合計時間の事前設定された割合より長いということを決定した場合、ユーザ固有の要約プログラム１２２は要約をユーザに出力しない。実施形態では、ユーザ固有の要約プログラム１２２が要約を準備することに応答して、ユーザ固有の要約プログラム１２２は、ユーザが仮想コラボレーション・サーバ・ツール上の会議に再接続するとき検出する。 In step 320, the user-specific summary program 122 detects when the user reconnects to the conference on the virtual collaboration server tool. In an embodiment, the user-specific summary program 122 captures the user's re-engagement time (i.e., the time during the conference when the user reconnects). In an embodiment, the user-specific summary program 122 calculates the total duration of the user's disengagement period (i.e., the total duration of time the user was disconnected from the conference). If the user-specific summary program 122 determines that the total duration of the user's disengagement period is longer than a preset period or a preset percentage of the pre-scheduled conference's total allotted time, the user-specific summary program 122 does not output a summary to the user. In an embodiment, in response to the user-specific summary program 122 preparing a summary, the user-specific summary program 122 detects when the user reconnects to the conference on the virtual collaboration server tool.

第１の例では、ユーザＡは、ユーザＡのネットワーク接続の問題を解決した後に、切断から１０分以内にユーザＡのオンライン仮想クラスに再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＡを検出し、ユーザＡの再関与時間を捕捉し、ユーザＡのオンライン仮想クラスからのユーザＡの離脱期間の全持続時間を、１０分であると計算する。 In a first example, User A reconnects to User A's online virtual class within 10 minutes of disconnection after resolving User A's network connection issues. The user-specific summary program 122 detects User A reconnecting, captures User A's re-engagement time, and calculates the total duration of User A's disengagement period from User A's online virtual class to be 10 minutes.

第２の例では、ユーザＢは、ユーザＢの個人的な緊急事態に対処した後に、切断から１５分以内にユーザＢのチーム会議に再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＢを検出し、ユーザＢの再関与時間を捕捉し、ユーザＢのチーム会議からのユーザＢの離脱期間の全持続時間を、１５分であると計算する。 In a second example, User B reconnects to User B's team conference within 15 minutes of disconnection after attending to User B's personal emergency. The user-specific summary program 122 detects User B reconnecting, captures User B's re-engagement time, and calculates the total duration of User B's withdrawal period from User B's team conference to be 15 minutes.

第３の例では、ユーザＣは、休憩した後に、切断から１０分以内に都市Ｘのオンライン市議会に再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＣを検出し、ユーザＣの再関与時間を捕捉し、都市Ｘのオンライン市議会からのユーザＣの離脱の全持続時間を、１０分であると計算する。 In a third example, User C takes a break and then reconnects to City X's online city council within 10 minutes of disconnection. The user-specific summary program 122 detects User C reconnecting, captures User C's re-engagement time, and calculates the total duration of User C's withdrawal from City X's online city council to be 10 minutes.

判定３２５で、ユーザ固有の要約プログラム１２２は、ユーザが会議に再び参加する前に要約をレビューするかどうかを判定する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ２２０でユーザが設定したユーザの嗜好に基づいて、ユーザが会議に再び参加する前に要約をレビューするかどうかを判定する。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの決定に基づいて、ユーザが会議に再び参加する前に要約をレビューするかどうかを判定する。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、機械駆動の推奨に基づいて、ユーザが会議に再び参加する前に要約をレビューするかどうかを判定する。例えば、再び参加する前に、ユーザの離脱期間中にユーザが聞きもらした情報は、再び参加する前に知る必要があるため、ユーザ固有の要約プログラム１２２は、ユーザが再び参加する前に要約を読むことを推奨する。ユーザが会議に再び参加する前に要約をレビューするかどうかを判定するユーザ固有の要約プログラム１２２の例は、本明細書では個別の方法を使用して説明されるが、ユーザ固有の要約プログラム１２２が上記の実施形態の１つまたは複数の組合せを介してユーザが会議に再び参加する前に要約をレビューするかどうかを判定してよいということに、注意するべきである。実施形態では、ユーザ固有の要約プログラム１２２が、ユーザが仮想コラボレーション・サーバ・ツール上の会議に再接続するとき検出することに応答して、ユーザ固有の要約プログラム１２２は、ユーザが会議に再び参加する前に要約をレビューするかどうかを判定する。 At decision 325, the user-specific summarizing program 122 determines whether the user will review the summary before rejoining the conference. In embodiments, the user-specific summarizing program 122 determines whether the user will review the summary before rejoining the conference based on the user preferences set by the user in step 220. In one or more embodiments, the user-specific summarizing program 122 determines whether the user will review the summary before rejoining the conference based on a user decision. In one or more embodiments, the user-specific summarizing program 122 determines whether the user will review the summary before rejoining the conference based on a machine-driven recommendation. For example, the user-specific summarizing program 122 recommends that the user read the summary before rejoining because any information the user missed during the user's absence needs to be known before rejoining. Although examples of the user-specific summarizing program 122 determining whether the user will review the summary before rejoining the conference are described herein using individual methods, it should be noted that the user-specific summarizing program 122 may determine whether the user will review the summary before rejoining the conference via one or more combinations of the above embodiments. In an embodiment, in response to the user-specific summary program 122 detecting when the user reconnects to the conference on the virtual collaboration server tool, the user-specific summary program 122 determines whether the user should review the summary before rejoining the conference.

ユーザ固有の要約プログラム１２２が、ユーザが会議に再び参加する前に要約をレビューするということを決定した場合（判定３２５の「はい」の分岐）、ユーザ固有の要約プログラム１２２は、ユーザが要約をレビューすることをどの程度好むか（すなわち、ユーザの嗜好）を選択するようユーザに促す（ステップ３３０）。ユーザ固有の要約プログラム１２２が、ユーザが会議に再び参加する前に要約をレビューしないということを決定した場合（判定３２５の「いいえ」の分岐）、ユーザ固有の要約プログラム１２２は、ユーザが要約をレビューすることをどの程度好むかを選択するようユーザに促す前に、会議が完了したかどうかを判定する（判定３６０）。 If the user-specific summarization program 122 determines that the user will review the summary before rejoining the conference (the "Yes" branch of decision 325), the user-specific summarization program 122 prompts the user to select how much the user prefers to review the summary (i.e., the user's preference) (step 330). If the user-specific summarization program 122 determines that the user will not review the summary before rejoining the conference (the "No" branch of decision 325), the user-specific summarization program 122 determines whether the conference has completed before prompting the user to select how much the user prefers to review the summary (decision 360).

判定３６０で、ユーザ固有の要約プログラム１２２は、会議が完了したかどうかを判定する。実施形態では、ユーザ固有の要約プログラム１２２は、ホストが会議を終わらせたときに、会議が完了したということを決定する。実施形態では、ユーザ固有の要約プログラム１２２が、ユーザが会議に再び参加する前に要約をレビューしないということを決定することに応答して、ユーザ固有の要約プログラム１２２は、会議が完了したかどうかを判定する。 At decision 360, the user-specific summary program 122 determines whether the conference is complete. In an embodiment, the user-specific summary program 122 determines that the conference is complete when the host ends the conference. In an embodiment, in response to the user-specific summary program 122 determining that the user will not review the summary before rejoining the conference, the user-specific summary program 122 determines whether the conference is complete.

ユーザ固有の要約プログラム１２２が、会議が完了したということを決定した場合（判定３６０の「はい」の分岐）、ユーザ固有の要約プログラム１２２は、ユーザが要約をレビューすることをどの程度好むかを選択するようユーザに促す（ステップ３３０）。ユーザ固有の要約プログラム１２２が、会議が完了していないということを決定した場合（判定３６０の「いいえ」の分岐）、ユーザ固有の要約プログラム１２２は、ステップ３３０に進む前に、会議が完了するまで待機する。 If the user-specific summary program 122 determines that the conference is complete (the "Yes" branch of decision 360), the user-specific summary program 122 prompts the user to select how much the user prefers to review the summary (step 330). If the user-specific summary program 122 determines that the conference is not complete (the "No" branch of decision 360), the user-specific summary program 122 waits until the conference is complete before proceeding to step 330.

例では、仮想コラボレーション・サーバ・ツール上の都市Ｘのオンライン市議会に出席している、都市Ｘの納税者であるユーザＣが、都市Ｘのオンライン市議会に再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＣを検出する。ユーザ固有の要約プログラム１２２は、会議に再び参加することからユーザＣを遅延させる代わりに、ユーザＣが、都市Ｘのオンライン市議会が完了するまで待機して、要約をレビューするという、機械駆動の推奨を作成する。ユーザ固有の要約プログラム１２２は、会議が完了するまで、ユーザＣが要約をレビューすることをどの程度好むかを選択するようユーザＣに促さない。 In the example, User C, a taxpayer for City X who is attending an online city council meeting for City X on the virtual collaboration server tool, reconnects to the online city council meeting for City X. The user-specific summarizing program 122 detects the reconnecting User C. Instead of delaying User C from rejoining the meeting, the user-specific summarizing program 122 makes a machine-driven recommendation that User C wait until the online city council meeting for City X is complete to review the summary. The user-specific summarizing program 122 does not prompt User C to select how much User C prefers to review the summary until the meeting is complete.

判定３２５に戻り、ユーザ固有の要約プログラム１２２が、ユーザが会議に再び参加する前に要約をレビューするということを決定した場合（判定３２５の「はい」の分岐）、ユーザ固有の要約プログラム１２２は、ステップ３３０に進み、ユーザがどのように要約をレビューすることを好むかを選択するようユーザに促す。ステップ３３０で、ユーザ固有の要約プログラム１２２は、ユーザがどのように要約をレビューすることを好むかを選択するようユーザに促す。実施形態では、ユーザ固有の要約プログラム１２２は、言語、視聴モード、音声、およびサブタイトルを含むが、これに限定されない設定のデフォルトの選択をユーザに促す。デフォルトの選択は、推奨される経験（例えば、英語（米語）、高帯域ビデオ視聴モード、音声のオン、およびサブタイトルのオフ）をユーザに提供し、ステップ２２０でユーザによって設定されたユーザの嗜好に基づく。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの作業環境により適した選択肢を選択するために、デフォルトの選択を変更するようユーザに促す。例えば、ユーザは、デフォルトの言語設定を代替の言語（例えば、ドイツ語、英語（米語）、スペイン語（ラテンアメリカ）、フランス語、フランス語（カナダ）、イタリア語、ポーランド語、ポルトガル語、ポルトガル語（ブラジル）など）に変更するか、デフォルトの視聴モードを代替の視聴モード（例えば、高帯域ビデオ、低帯域ビデオ、テキスト、およびグラフィックス）に変更するか、デフォルトの音声設定（例えば、オンまたはオフ）を変更するか、またはデフォルトのサブタイトル設定（例えば、オンまたはオフ）を変更するか、あるいはその組合せを実行してよい。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介して、ユーザがどのように要約をレビューすることを好むかを選択するようユーザに促す。実施形態では、ユーザ固有の要約プログラム１２２が、ユーザが要約をレビューするということを決定することに応答して、ユーザ固有の要約プログラム１２２は、ユーザがどのように要約をレビューすることを好むかを選択するようユーザに促す。 Returning to decision 325, if the user-specific summarization program 122 determines that the user will review the summary before rejoining the conference (the "Yes" branch of decision 325), the user-specific summarization program 122 proceeds to step 330 and prompts the user to select how the user prefers to review the summary. In step 330, the user-specific summarization program 122 prompts the user to select how the user prefers to review the summary. In an embodiment, the user-specific summarization program 122 prompts the user for default selections of settings, including, but not limited to, language, viewing mode, audio, and subtitles. The default selections provide the user with a recommended experience (e.g., English (American), high-bandwidth video viewing mode, audio on, and subtitles off) and are based on the user preferences set by the user in step 220. In an embodiment, the user-specific summarization program 122 prompts the user to change the default selections to select options that are more suitable for the user's work environment. For example, the user may change the default language setting to an alternate language (e.g., German, English (US), Spanish (Latin America), French, French (Canadian), Italian, Polish, Portuguese, Portuguese (Brazilian), etc.), change the default viewing mode to an alternate viewing mode (e.g., high bandwidth video, low bandwidth video, text, and graphics), change the default audio setting (e.g., on or off), or change the default subtitle setting (e.g., on or off), or any combination thereof. In an embodiment, user-specific summarization program 122 prompts the user, via user interface 132 of user computing device 130, to select how the user prefers to review the summary. In an embodiment, in response to user-specific summarization program 122 determining that the user will review the summary, user-specific summarization program 122 prompts the user to select how the user prefers to review the summary.

ステップ３３５で、ユーザ固有の要約プログラム１２２は要約を出力する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ３３０でユーザによって選択された形式で要約を出力する。実施形態では、ユーザ固有の要約プログラム１２２は、要約を警告通知として出力する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介して、要約をユーザに出力する。実施形態では、ユーザ固有の要約プログラム１２２が、ユーザがどのように要約をレビューすることを好むかを選択するようユーザに促すことに応答して、ユーザ固有の要約プログラム１２２は要約を出力する。 In step 335, the user-specific summary program 122 outputs the summary. In an embodiment, the user-specific summary program 122 outputs the summary in the format selected by the user in step 330. In an embodiment, the user-specific summary program 122 outputs the summary as an alert notification. In an embodiment, the user-specific summary program 122 outputs the summary to the user via the user interface 132 of the user computing device 130. In an embodiment, the user-specific summary program 122 outputs the summary in response to the user-specific summary program 122 prompting the user to select how the user prefers to review the summary.

第１の例では、仮想コラボレーション・サーバ・ツール上のオンライン仮想クラスに出席し、ネットワーク接続の問題を経験していたコンピュータ科学の学生であるユーザＡが、ユーザＡのオンライン仮想クラスに再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＡを検出し、ユーザＡがクラスに再び参加する前に要約をレビューすることを好むか、またはクラスが完了するまで待機して、要約をレビューするかを尋ねる。ユーザＡは、ユーザＡのオンライン仮想クラスに再び参加する前に要約をレビューすることを選択する。ユーザＡは、デフォルトの選択を使用して要約をレビューすることを選択し、小会議室に移動される。ユーザＡは、音声付きの３０秒のビデオを受信して、見る。このときユーザＡは、ユーザＡが切り離されていた間に聞きもらしたことに関する最新情報を取得し、ユーザＡのオンライン仮想クラスに再び参加する準備ができる。 In a first example, User A, a computer science student who was attending an online virtual class on a virtual collaboration server tool and experiencing network connection issues, reconnects to User A's online virtual class. The user-specific summary program 122 detects User A reconnecting and asks whether User A would prefer to review the summary before rejoining the class or to wait until the class is complete to review the summary. User A chooses to review the summary before rejoining User A's online virtual class. User A chooses to review the summary using the default selection and is taken to a breakout room. User A receives and views a 30-second video with audio. User A now has an update on what User A missed while disconnected and is ready to rejoin User A's online virtual class.

第２の例では、自宅から作業し、仮想コラボレーション・サーバ・ツール上でユーザＢのチームとのチーム会議に参加していた、技術企業の従業員であるユーザＢが、ユーザＢのチーム会議に再接続する。ユーザ固有の要約プログラム１２２は、再接続しているユーザＢを検出し、ユーザＢがチーム会議に再び参加する前に要約をレビューすることを好むか、またはチーム会議が完了するまで待機して、要約をレビューするかを尋ねる。ユーザＢは、チーム会議に再び参加する前に要約をレビューすることを選択する。ユーザＢは、デフォルトの音声設定を「オフ」に変更し、デフォルトのサブタイトル設定を「オン」に変更する。ユーザＢは、現在の会議室にとどまって要約をレビューする。ユーザＢは、サブタイトル付きの３０秒のビデオを受信して、見る。このときユーザＢは、ユーザＢが切り離されていた間に聞きもらしたことに関する最新情報を取得し、ユーザＢのチーム会議に再び参加する準備ができる。 In a second example, User B, an employee of a technology company who works from home and was participating in a team conference with User B's team on a virtual collaboration server tool, reconnects to User B's team conference. The user-specific summarization program 122 detects User B reconnecting and asks if User B prefers to review the summary before rejoining the team conference or waits until the team conference is complete to review the summary. User B chooses to review the summary before rejoining the team conference. User B changes the default audio setting to "off" and the default subtitle setting to "on." User B remains in the current conference room and reviews the summary. User B receives and watches a 30-second video with subtitles. User B now has an update on what he missed while he was disconnected and is ready to rejoin User B's team conference.

第３の例では、都市Ｘのオンライン市議会が完了する。ユーザ固有の要約プログラム１２２は、要約をユーザＣに出力する。ユーザＣは、音声付きの３０秒のビデオを受信して、見る。このときユーザＣは、ユーザＣが都市Ｘのオンライン市議会から切り離されていた間に聞きもらしたことに関する最新情報を取得する。 In a third example, an online city meeting for City X is completed. The user-specific summary program 122 outputs a summary to User C, who receives and views a 30-second video with audio. User C now has an update on what he missed while he was disconnected from City X's online city meeting.

ステップ３４０で、ユーザ固有の要約プログラム１２２は、要約を会議の完全な記録の再生と比較する。実施形態では、ユーザ固有の要約プログラム１２２は、主要な実体を、要約から、および会議の完全な記録の再生から抽出する。実施形態では、ユーザ固有の要約プログラム１２２は、要約から、および会議の完全な記録の再生から抽出された、類似している主要な実体を照合する。実施形態では、ユーザ固有の要約プログラム１２２は、要約および会議の意味が同じであることを保証するために、要約を、会議の完全な記録の再生と比較する。実施形態では、ユーザ固有の要約プログラム１２２が要約を出力することに応答して、ユーザ固有の要約プログラム１２２は、要約を会議の完全な記録の再生と比較する。 In step 340, the user-specific summarization program 122 compares the summary with a playback of the full recording of the meeting. In an embodiment, the user-specific summarization program 122 extracts key entities from the summary and from a playback of the full recording of the meeting. In an embodiment, the user-specific summarization program 122 matches similar key entities extracted from the summary and from a playback of the full recording of the meeting. In an embodiment, the user-specific summarization program 122 compares the summary with a playback of the full recording of the meeting to ensure that the meaning of the summary and the meeting are the same. In an embodiment, in response to the user-specific summarization program 122 outputting the summary, the user-specific summarization program 122 compares the summary with a playback of the full recording of the meeting.

ステップ３４５で、ユーザ固有の要約プログラム１２２はユーザに対してフィードバックを要求する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ３３５での要約の出力に関して、ユーザに対してフィードバックを要求する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが１つのみを選択できる３つの選択肢（すなわち、－１、０、＋１）をユーザに提供する。ユーザに提供される３つの選択肢は、ユーザのあり得る満足度（すなわち、それぞれ不満足、中立、満足）を表す。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介してユーザに対してフィードバックを要求する。実施形態では、要約を会議の完全な記録の再生と比較することに応答して、ユーザ固有の要約プログラム１２２はユーザに対してフィードバックを要求する。 In step 345, the user-specific summarization program 122 requests feedback from the user. In an embodiment, the user-specific summarization program 122 requests feedback from the user regarding the output of the summary in step 335. In an embodiment, the user-specific summarization program 122 provides the user with three options (i.e., -1, 0, +1) from which the user can select only one. The three options provided to the user represent the user's possible satisfaction levels (i.e., dissatisfied, neutral, and satisfied, respectively). In an embodiment, the user-specific summarization program 122 requests feedback from the user via the user interface 132 of the user computing device 130. In an embodiment, in response to comparing the summary with a playback of the full recording of the conference, the user-specific summarization program 122 requests feedback from the user.

ステップ３５０で、ユーザ固有の要約プログラム１２２はユーザからフィードバックを受信する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介してユーザからフィードバックを受信する。実施形態では、ユーザ固有の要約プログラム１２２は、フィードバックを使用して、プロセスの未来の反復でより調整された要約を準備することにおいてユーザ固有の要約プログラム１２２を改良する。実施形態では、ユーザ固有の要約プログラム１２２は、強化学習を使用してユーザ固有の要約プログラム１２２を改良する。実施形態では、ユーザ固有の要約プログラム１２２は、ＲｅＬＵまたはリーキーＲｅＬＵを活性化関数として使用して、精度を向上させる。実施形態では、ユーザ固有の要約プログラム１２２は、音声フレームまたはビデオ・フレームあるいはその両方に対して音声テキスト（ＳＴＴ：speech-to-text）認識を実行して、三つ組(triples)を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、上位Ｎ個のランク付けされ文をエンド・ユーザに勧める。実施形態では、ユーザ固有の要約プログラム１２２は、会議サマライザ（meeting summarizer）を強化学習エージェントとして使用し、協調プロセスの間のマルコフ決定プロセスの後に、反復プロセスをモデル化する。実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムの実行時間を加速するために、状態とアクション（Ｓ－Ａ：State-Action）の対に対してＱ学習技術を実行する。Ｑ学習技術は、強化学習（ＲＬ：reinforcement learning）エージェントのカンニング・ペーパー(crib sheet)として機能する。Ｑ学習技術は、ＲＬエージェントが環境からのフィードバックを使用して、さまざまな環境内で実行できる最良のアクションを学習することを可能にする。Ｑ学習技術は、Ｑ値も使用して、ＲＬエージェントの性能を追跡し、改善する。最初に、Ｑ値は任意の値に設定される。しかし、ＲＬエージェントがさまざまなアクションを実行し、アクションに対するフィードバック（すなわち、不満足、中立、満足）を受信するときに、Ｑ値が更新される。実施形態では、ユーザ固有の要約プログラム１２２は、時間が増加するにつれて報酬が増えるように、強化学習システムを設計する。エピソードごとの報酬の最大値は、総報酬を最大化することによってＲＬエージェントが正しいアクションを実行することを学習したことを示す。実施形態では、ユーザ固有の要約プログラム１２２がユーザに対してフィードバックを要求することに応答して、ユーザ固有の要約プログラム１２２はフィードバックをユーザから受信する。 In step 350, the user-specific summarization program 122 receives feedback from the user. In an embodiment, the user-specific summarization program 122 receives feedback from the user via the user interface 132 of the user computing device 130. In an embodiment, the user-specific summarization program 122 uses the feedback to improve the user-specific summarization program 122 in preparing more tailored summaries in future iterations of the process. In an embodiment, the user-specific summarization program 122 uses reinforcement learning to improve the user-specific summarization program 122. In an embodiment, the user-specific summarization program 122 uses ReLU or leaky ReLU as an activation function to improve accuracy. In an embodiment, the user-specific summarization program 122 performs speech-to-text (STT) recognition on the audio frames and/or video frames to prepare triplets. In an embodiment, the user-specific summarization program 122 recommends the top N ranked sentences to the end user. In an embodiment, the user-specific summarizer 122 uses a meeting summarizer as a reinforcement learning agent to model an iterative process after a Markov decision process during a collaboration process. In an embodiment, the user-specific summarizer 122 performs Q-learning techniques on state-action (SA) pairs to accelerate the execution time of the algorithm. The Q-learning technique serves as a crib sheet for the reinforcement learning (RL) agent. The Q-learning technique allows the RL agent to use feedback from the environment to learn the best actions it can perform in various environments. The Q-learning technique also uses a Q-value to track and improve the performance of the RL agent. Initially, the Q-value is set to an arbitrary value. However, as the RL agent performs various actions and receives feedback on the actions (i.e., dissatisfied, neutral, satisfied), the Q-value is updated. In an embodiment, the user-specific summarizer 122 designs the reinforcement learning system so that rewards increase over time. The maximum per-episode reward indicates that the RL agent has learned to perform the correct action by maximizing the total reward. In an embodiment, in response to the user-specific summary program 122 requesting feedback from the user, the user-specific summary program 122 receives feedback from the user.

ステップ３５５で、ユーザ固有の要約プログラム１２２はフィードバックを格納する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザから受信されたフィードバックを格納する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザから受信されたフィードバックをデータベース（例えば、データベース１２６）に格納する。実施形態では、ユーザ固有の要約プログラム１２２がフィードバックをユーザから受信することに応答して、ユーザ固有の要約プログラム１２２はフィードバックを格納する。 In step 355, the user-specific summary program 122 stores the feedback. In an embodiment, the user-specific summary program 122 stores the feedback received from the user. In an embodiment, the user-specific summary program 122 stores the feedback received from the user in a database (e.g., database 126). In an embodiment, in response to the user-specific summary program 122 receiving the feedback from the user, the user-specific summary program 122 stores the feedback.

一部の実施形態では、ユーザ固有の要約プログラム１２２は、任意選択的なステップとして、ステップ３４５、３５０、および３５５を実行してよい。 In some embodiments, the user-specific summary program 122 may perform steps 345, 350, and 355 as optional steps.

図４は、本発明の実施形態に従って、図１の分散データ処理環境１００内のユーザ固有の要約プログラム１２２の機械学習コンポーネント１２４の動作ステップを示す、概して４００と指定されたフローチャートである。実施形態では、ユーザ固有の要約プログラム１２２は、マージ、抽出要約、話者ダイアライゼーション、および機械学習の概念を適用することによって、仮想コラボレーション・サーバ・ツール上の会議から離れているユーザのプロフィールに合わせて調整された、ユーザが切り離されていた間の会議の一部を対象にする要約を準備するように動作する。実施形態では、ユーザ固有の要約プログラム１２２の機械学習コンポーネント１２４は、ユーザが仮想コラボレーション・サーバ・ツール上でホストするか、参加するか、または出席する会議の全持続時間の間、継続的に稼働する。図４に示されたプロセスがプロセス・フローの１つの可能な反復を示しているということが理解されるべきである。 FIG. 4 is a flowchart generally designated 400 illustrating operational steps of the machine learning component 124 of the user-specific summarization program 122 in the distributed data processing environment 100 of FIG. 1 in accordance with an embodiment of the present invention. In an embodiment, the user-specific summarization program 122 operates to prepare summaries tailored to the profile of a user away from a conference on a virtual collaboration server tool, covering the portion of the conference while the user was disconnected, by applying concepts of merging, extractive summarization, speaker diarization, and machine learning. In an embodiment, the machine learning component 124 of the user-specific summarization program 122 runs continuously for the entire duration of a conference that the user hosts, participates in, or attends on the virtual collaboration server tool. It should be understood that the process illustrated in FIG. 4 illustrates one possible iteration of the process flow.

ステップ４０５で、ユーザ固有の要約プログラム１２２は、会議の２人以上の参加者を識別する。実施形態では、ユーザ固有の要約プログラム１２２は、仮想コラボレーション・サーバ・ツールから（例えば、参加者リストから、または出席報告から）収集された情報を介して、会議の２人以上の参加者を識別する。実施形態では、ユーザ固有の要約プログラム１２２は、重み付けを会議の２人以上の参加者に割り当てる。実施形態では、ユーザ固有の要約プログラム１２２は、要因のセットに基づいて、重み付けを会議の２人以上の参加者に割り当てる。要因のセットは、会議における参加者の役割（すなわち、会議の主催者、会議の要求された出席者、会議の任意選択的な出席者など）、参加者の企業における参加者の役割、および会議から離れたユーザとの参加者の関連性を含むが、これらに限定されない。実施形態では、ユーザ固有の要約プログラム１２２は、参加者の割り当てられた重み付けに基づいて、会議の２人以上の参加者をランク付けする（すなわち、ユーザに最も関連する参加者からユーザに最も関連しない参加者まで、参加者をランク付けする）。実施形態では、ユーザ固有の要約プログラム１２２は、２人以上の参加者のそのようなランク付けをくつがえす(override)能力をユーザに提供する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザ・コンピューティング・デバイス１３０のユーザ・インターフェイス１３２を介して、そのようなランク付けをくつがえす能力をユーザに提供する。 In step 405, the user-specific summarization program 122 identifies two or more participants of the conference. In an embodiment, the user-specific summarization program 122 identifies two or more participants of the conference via information collected from the virtual collaboration server tool (e.g., from a participant list or from an attendance report). In an embodiment, the user-specific summarization program 122 assigns weights to two or more participants of the conference. In an embodiment, the user-specific summarization program 122 assigns weights to two or more participants of the conference based on a set of factors. The set of factors includes, but is not limited to, the participant's role in the conference (i.e., the conference organizer, the requested attendee of the conference, the optional attendee of the conference, etc.), the participant's role in the participant's enterprise, and the participant's relevance to the user away from the conference. In an embodiment, the user-specific summarization program 122 ranks the two or more participants of the conference based on the participants' assigned weights (i.e., ranks the participants from most relevant to the user to least relevant to the user). In an embodiment, the user-specific summarization program 122 provides the user with the ability to override such rankings of two or more participants. In an embodiment, the user-specific summarization program 122 provides the user with the ability to override such rankings via the user interface 132 of the user computing device 130.

ステップ４１０で、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方を抽出する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが切り離されていた会議の一部から、複数の音声フレームまたはビデオ・フレームあるいはその両方を抽出する。実施形態では、ユーザ固有の要約プログラム１２２は、要約が準備されて、会議から離れていたユーザに出力されるまで、複数の音声フレームまたはビデオ・フレームあるいはその両方を継続的に抽出する。 In step 410, the user-specific summary program 122 extracts multiple audio and/or video frames. In an embodiment, the user-specific summary program 122 extracts multiple audio and/or video frames from the portion of the conference from which the user was disconnected. In an embodiment, the user-specific summary program 122 continuously extracts multiple audio and/or video frames until a summary is prepared and output to the user who was disconnected from the conference.

実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが切り離されていた会議の一部から抽出された音声フレームまたはビデオ・フレームあるいはその両方の数を数える。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの名前を含んでいる音声フレームまたはビデオ・フレームあるいはその両方（例えば、ユーザが話すことまたは質問に回答することが期待されたため、会議のホストによってユーザの名前が言及されていた間の音声フレームもしくはビデオ・フレームまたはその両方、あるいは小規模なブレークアウトグループに参加するように会議のホストによってユーザの名前が呼ばれていた間の音声フレームもしくはビデオ・フレームまたはその両方）の数を数える。実施形態では、ユーザ固有の要約プログラム１２２が会議の２人以上の参加者を識別することに応答して、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方を抽出する。 In an embodiment, the user-specific summarization program 122 counts the number of audio and/or video frames extracted from the portion of the conference from which the user was disconnected. In an embodiment, the user-specific summarization program 122 counts the number of audio and/or video frames containing the user's name (e.g., audio and/or video frames during which the user's name was mentioned by the conference host because the user was expected to speak or answer a question, or audio and/or video frames during which the user's name was called by the conference host to join a smaller breakout group). In an embodiment, in response to the user-specific summarization program 122 identifying two or more participants in the conference, the user-specific summarization program 122 extracts multiple audio and/or video frames.

ステップ４１５で、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方の文脈を識別する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザが切り離されていた会議の一部から抽出された複数の音声フレームまたはビデオ・フレームあるいはその両方の文脈を識別する。実施形態では、ユーザ固有の要約プログラム１２２は、会議の意図またはトピックあるいはその両方を理解するために、複数の音声フレームまたはビデオ・フレームあるいはその両方の文脈を識別する。実施形態では、ユーザ固有の要約プログラム１２２が複数の音声フレームまたはビデオ・フレームあるいはその両方を抽出することに応答して、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方の文脈を識別する。 In step 415, the user-specific summarization program 122 identifies the context of the multiple audio frames and/or video frames. In an embodiment, the user-specific summarization program 122 identifies the context of the multiple audio frames and/or video frames extracted from a portion of the conference from which the user was detached. In an embodiment, the user-specific summarization program 122 identifies the context of the multiple audio frames and/or video frames to understand the intent and/or topic of the conference. In an embodiment, in response to the user-specific summarization program 122 extracting the multiple audio frames and/or video frames, the user-specific summarization program 122 identifies the context of the multiple audio frames and/or video frames.

ステップ４２０で、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットを選択する。実施形態では、ユーザ固有の要約プログラム１２２は、音声フレームまたはビデオ・フレームあるいはその両方が、会議から離れていたユーザのための要約の準備に寄与するかどうかに基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットを選択する。実施形態では、ユーザ固有の要約プログラム１２２が複数の音声フレームまたはビデオ・フレームあるいはその両方の文脈を識別することに応答して、ユーザ固有の要約プログラム１２２は、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットを選択する。 In step 420, the user-specific summarization program 122 selects a subset of the plurality of audio frames and/or video frames. In an embodiment, the user-specific summarization program 122 selects a subset of the plurality of audio frames and/or video frames based on whether the audio frames and/or video frames contribute to preparing a summary for a user who was away from the conference. In an embodiment, in response to the user-specific summarization program 122 identifying the context of the plurality of audio frames and/or video frames, the user-specific summarization program 122 selects a subset of the plurality of audio frames and/or video frames.

ステップ４２５で、ユーザ固有の要約プログラム１２２は、ステップ４２０で選択された複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。実施形態では、ユーザ固有の要約プログラム１２２は、複数の音声フレームのサブセットをランク付けする。別の実施形態では、ユーザ固有の要約プログラム１２２は、複数のビデオ・フレームのサブセットをランク付けする。別の実施形態では、ユーザ固有の要約プログラム１２２は、複数の音声およびビデオ・フレームのサブセットをランク付けする。 In step 425, the user-specific summarization program 122 ranks the subset of audio frames and/or video frames selected in step 420. In an embodiment, the user-specific summarization program 122 ranks the subset of audio frames. In another embodiment, the user-specific summarization program 122 ranks the subset of video frames. In another embodiment, the user-specific summarization program 122 ranks the subset of audio and video frames.

実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズム法を使用して複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、音声フレームまたはビデオ・フレームあるいはその両方が、会議から離れていたユーザの名前を含んでいるかどうかに基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ステップ４０５で受信された参加者をランク付けすることに基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、参加者と会議から離れていたユーザの間の関係に基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。会議から離れていたユーザとの参加者の関連性は、会議における参加者の役割（すなわち、会議の主催者、会議の要求された出席者、会議の任意選択的な出席者）、参加者の企業プロフィール、参加者の企業の階層、企業の階層内の参加者の地位、参加者の企業内の参加者の年功序列、参加者の主要な仕事、参加者の主要な仕事の職務を含むが、これらに限定されない、因子のセットによって決定される。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの技術に関する関心に基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、ユーザの専門知識に基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、過去の分析に基づいて、複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けするユーザ固有の要約プログラム１２２の例は、本明細書では個別の方法を使用して説明されるが、ユーザ固有の要約プログラム１２２が上記の実施形態の１つまたは複数の組合せを介して複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けしてよいということに、注意するべきである。 In embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames using an algorithmic method. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on whether the audio frame and/or video frame contains the name of a user who was absent from the conference. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on ranking the participants received in step 405. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on the relationship between the participant and the user who was absent from the conference. A participant's relevance to the user who was away from the conference is determined by a set of factors including, but not limited to, the participant's role in the conference (i.e., conference organizer, conference required attendee, conference optional attendee), the participant's company profile, the participant's company hierarchy, the participant's position within the company hierarchy, the participant's seniority within the participant's company, the participant's primary job, and the participant's primary job function. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on the user's technology interests. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on the user's expertise. In one or more embodiments, the user-specific summarization program 122 ranks a subset of the plurality of audio frames and/or video frames based on past analysis. Although examples of the user-specific summarization program 122 ranking a subset of multiple audio and/or video frames are described herein using separate methods, it should be noted that the user-specific summarization program 122 may rank a subset of multiple audio and/or video frames via one or more combinations of the above embodiments.

実施形態では、ユーザ固有の要約プログラム１２２が複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットを選択することに応答して、ユーザ固有の要約プログラム１２２は、ステップ４２０で選択された複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けする。 In an embodiment, in response to the user-specific summarization program 122 selecting a subset of the plurality of audio frames and/or video frames, the user-specific summarization program 122 ranks the selected subset of the plurality of audio frames and/or video frames in step 420.

ステップ４３０で、ユーザ固有の要約プログラム１２２は統合された要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームの統合された要約を準備する。別の実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされた複数のビデオ・フレームの統合された要約を準備する。別の実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声およびビデオ・フレームの統合された要約を準備する。 At step 430, the user-specific summarization program 122 prepares an integrated summary. In an embodiment, the user-specific summarization program 122 prepares an integrated summary of multiple audio frames ranked above an algorithmically determined threshold. In another embodiment, the user-specific summarization program 122 prepares an integrated summary of multiple video frames ranked above an algorithmically determined threshold. In another embodiment, the user-specific summarization program 122 prepares an integrated summary of multiple audio and video frames ranked above an algorithmically determined threshold.

実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームまたはビデオ・フレームあるいはその両方をマージすることによって、統合された要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームまたはビデオ・フレームあるいはその両方を連続的に一緒につなぎ合わせて、アルゴリズムで決定されたしきい値より上にランク付けされた複数の音声フレームまたはビデオ・フレームあるいはその両方をマージすることによって、統合された要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、アルゴリズムで決定されたしきい値より上にランク付けされなかった複数の音声フレームまたはビデオ・フレームあるいはその両方を取り消すことによって、統合された要約を準備する。しきい値は、動的であり、ユーザ固有の要約プログラム１２２のユーザごとに変わる。実施形態では、ユーザ固有の要約プログラム１２２は、トレーニング・データ・セットに対する機械学習に合わせて、アルゴリズムで決定されたしきい値を設定する。 In an embodiment, the user-specific summarization program 122 prepares the integrated summary by merging multiple audio and/or video frames ranked above an algorithmically determined threshold. In an embodiment, the user-specific summarization program 122 prepares the integrated summary by sequentially splicing together multiple audio and/or video frames ranked above an algorithmically determined threshold and merging multiple audio and/or video frames ranked above an algorithmically determined threshold. In an embodiment, the user-specific summarization program 122 prepares the integrated summary by discarding multiple audio and/or video frames that are not ranked above the algorithmically determined threshold. The threshold is dynamic and changes for each user of the user-specific summarization program 122. In an embodiment, the user-specific summarization program 122 sets the algorithmically determined threshold in accordance with machine learning on a training data set.

実施形態では、ユーザ固有の要約プログラム１２２が複数の音声フレームまたはビデオ・フレームあるいはその両方のサブセットをランク付けすることに応答して、ユーザ固有の要約プログラム１２２は、統合された要約を準備する。 In an embodiment, in response to the user-specific summarization program 122 ranking a subset of the plurality of audio frames and/or video frames, the user-specific summarization program 122 prepares an integrated summary.

第１の例では、仮想コラボレーション・サーバ・ツール上のユーザＡのオンライン仮想クラスが、教師Ｘによってホストされる。招待講演者Ｙも、ユーザＡのオンライン仮想クラスに参加する。ユーザＡのネットワーク接続の問題のために、ユーザＡがオンライン仮想クラスから切り離されている間に、教師Ｘは、２５枚のＭｉｃｒｏｓｏｆｔ（Ｒ）ＰｏｗｅｒＰｏｉｎｔスライドを提示する。教師Ｘが提示する最初のスライドは、スライド１である。しかし、教師Ｘは、招待講演者Ｙが後でクラスで提示する内容を説明するために、非常に簡単にスライド２５を参照する。次に教師Ｘは、ユーザＡがユーザＡのオンライン仮想クラスに再接続する前に、スライド２～２４に戻って参照し、説明する。この場合、ユーザ固有の要約プログラム１２２は、スライド１および２５が高度なコンテンツ情報を含んでいるため、スライド１および２５をスライド２～２４より高くランク付けする。スライド１および２５がマージされる。 In a first example, User A's online virtual class on a virtual collaboration server tool is hosted by Teacher X. Invited speaker Y also joins User A's online virtual class. While User A is disconnected from the online virtual class due to User A's network connection issues, Teacher X presents 25 Microsoft® PowerPoint slides. The first slide Teacher X presents is slide 1. However, Teacher X very briefly refers to slide 25 to explain what Invited speaker Y will present later in the class. Teacher X then returns to and explains slides 2-24 before User A reconnects to User A's online virtual class. In this case, the user-specific summarization program 122 ranks slides 1 and 25 higher than slides 2-24 because slides 1 and 25 contain more content information. Slides 1 and 25 are merged.

第２の例では、仮想コラボレーション・サーバ・ツール上のユーザＢのチーム会議は、チーム・リーダーＸによってホストされる。ユーザＢがチーム会議から切り離されている間に、チーム・リーダーＸは、ビデオがあまり技術的でなく、より図解的であるため、技術的スライドよりも非常に良くチーム・メンバーの共感を呼んだビデオを提示する。この場合、ユーザ固有の要約プログラム１２２は、技術的スライドより高くビデオをランク付けする。ビデオおよび他の高くランク付けされたスライドが一緒にマージされるが、技術的スライドは取り消される。 In a second example, User B's team meeting on the virtual collaboration server tool is hosted by Team Leader X. While User B is disconnected from the team meeting, Team Leader X presents a video that resonates much better with team members than the technical slides because the video is less technical and more illustrative. In this case, the user-specific summarization program 122 ranks the video higher than the technical slides. The video and the other highly ranked slides are merged together, but the technical slides are canceled.

ステップ４３５で、ユーザ固有の要約プログラム１２２は、統合された要約を１つまたは複数の文に変換する。実施形態では、ユーザ固有の要約プログラム１２２は、統合された要約をテキスト形式で１つまたは複数の文に変換する。実施形態では、ユーザ固有の要約プログラム１２２は、抽出テキスト要約アルゴリズムを使用して、統合された要約を１つまたは複数の文に変換する。実施形態では、ユーザ固有の要約プログラム１２２は、統合された要約内の重要な文および他の際立った情報を識別する。実施形態では、ユーザ固有の要約プログラム１２２は、統合された要約から重要な文および他の際立った情報を抽出する。会議の全持続時間の間、抽出テキスト要約アルゴリズムを使用する音声テキスト要約が発生するが、持続時間内の３０分のローリング・ウィンドウを開いた状態に保つ。実施形態では、ユーザ固有の要約プログラム１２２が統合された要約を準備することに応答して、ユーザ固有の要約プログラム１２２は、統合された要約を１つまたは複数の文に変換する。 In step 435, the user-specific summarization program 122 converts the integrated summary into one or more sentences. In an embodiment, the user-specific summarization program 122 converts the integrated summary into one or more sentences in text format. In an embodiment, the user-specific summarization program 122 converts the integrated summary into one or more sentences using an extractive text summarization algorithm. In an embodiment, the user-specific summarization program 122 identifies important sentences and other salient information in the integrated summary. In an embodiment, the user-specific summarization program 122 extracts important sentences and other salient information from the integrated summary. Speech-to-text summarization using the extractive text summarization algorithm occurs for the entire duration of the conference, but keeps a rolling 30-minute window open within the duration. In an embodiment, in response to the user-specific summarization program 122 preparing the integrated summary, the user-specific summarization program 122 converts the integrated summary into one or more sentences.

ステップ４４０で、ユーザ固有の要約プログラム１２２は話者ダイアライゼーションを適用する。話者ダイアライゼーションは、話者の識別情報に従って入力音声ストリームを同質のセグメントに分割するプロセスである。話者ダイアライゼーションは、音声ストリームを話者の順番に構造化することによって、および話者認識システムと共に使用された場合に、話者の真の識別情報を提供することによって、自動音声文字起こしの可読性を高める。話者ダイアライゼーションは、話者セグメント化および話者クラスタ化の組合せである。話者セグメント化は、音声ストリーム内の話者の変化点を検出することを目標とする。話者クラスタ化は、話者の特性に基づいて音声セグメントを一緒にグループ化することを目標とする。実施形態では、ユーザ固有の要約プログラム１２２は、音声フレーム内の話者の変化点を検出する（すなわち、話者セグメント化）。実施形態では、ユーザ固有の要約プログラム１２２は、話者の特性に基づいて音声セグメントを一緒にグループ化する（すなわち、話者クラスタ化）。実施形態では、ユーザ固有の要約プログラム１２２が統合された要約を１つまたは複数の文に変換することに応答して、ユーザ固有の要約プログラム１２２は、話者ダイアライゼーションを適用する。 In step 440, the user-specific summarization program 122 applies speaker diarization. Speaker diarization is the process of dividing the input audio stream into homogeneous segments according to speaker identities. Speaker diarization improves the readability of automatic speech transcription by structuring the audio stream into speaker order and, when used in conjunction with a speaker recognition system, by providing true speaker identities. Speaker diarization is a combination of speaker segmentation and speaker clustering. Speaker segmentation aims to detect speaker change points within an audio stream. Speaker clustering aims to group audio segments together based on speaker characteristics. In an embodiment, the user-specific summarization program 122 detects speaker change points within an audio frame (i.e., speaker segmentation). In an embodiment, the user-specific summarization program 122 groups audio segments together based on speaker characteristics (i.e., speaker clustering). In an embodiment, in response to the user-specific summarization program 122 converting the integrated summary into one or more sentences, the user-specific summarization program 122 applies speaker diarization.

ステップ４４５で、ユーザ固有の要約プログラム１２２は、１つまたは複数の文をランク付けする。実施形態では、ユーザ固有の要約プログラム１２２は、会議における話者の役割（すなわち、ホスト、参加者、出席者）、および会議の意図またはトピックあるいはその両方との文の関連性を含むが、これらに限定されない、因子のセットに基づいて、１つまたは複数の文をランク付けする。実施形態では、ユーザ固有の要約プログラム１２２は、上位Ｎ個の最も有用な文を保持する。Ｎは、ステップ２２０でユーザによって設定されたユーザの嗜好または機械駆動の推奨であってよい。実施形態では、ユーザ固有の要約プログラム１２２が話者ダイアライゼーションを適用することに応答して、ユーザ固有の要約プログラム１２２は、１つまたは複数の文をランク付けする。 In step 445, the user-specific summarization program 122 ranks the one or more sentences. In an embodiment, the user-specific summarization program 122 ranks the one or more sentences based on a set of factors, including, but not limited to, the speaker's role in the meeting (i.e., host, participant, attendee) and the relevance of the sentence to the meeting's intent and/or topic. In an embodiment, the user-specific summarization program 122 retains the top N most useful sentences, where N may be a user preference set by the user in step 220 or a machine-driven recommendation. In an embodiment, in response to the user-specific summarization program 122 applying speaker diarization, the user-specific summarization program 122 ranks the one or more sentences.

例えば、会議からのユーザの離脱の持続時間の間に、重要人物（例えば、会議のホスト）が１００個の文を話した場合、ユーザ固有の要約プログラム１２２は、重要性の順に１００個の文をすべてランク付けする。しかし、Ｎが２５に設定されたため、ユーザ固有の要約プログラム１２２は、話された上位２５個の文のみを保持する。 For example, if an important person (e.g., the conference host) spoke 100 sentences during the duration of the user's absence from the conference, the user-specific summarization program 122 would rank all 100 sentences in order of importance. However, because N was set to 25, the user-specific summarization program 122 would only retain the top 25 sentences spoken.

別の例では、会議からのユーザの離脱の持続時間の間に、重要人物（例えば、会議のホスト）が１００個の文を話した場合、ユーザ固有の要約プログラム１２２は、重要性の順に１００個の文をすべてランク付けする。１００個の文がすべて重要であるため、ユーザ固有の要約プログラム１２２に対して１００個の文をすべて保持することの機械駆動の推奨が行われ、そのためユーザ固有の要約プログラム１２２は、１００個の文をすべて保持する。 In another example, if an important person (e.g., the conference host) speaks 100 sentences during the duration of the user's absence from the conference, the user-specific summarization program 122 ranks all 100 sentences in order of importance. Because all 100 sentences are important, a machine-driven recommendation is made to the user-specific summarization program 122 to retain all 100 sentences, and so the user-specific summarization program 122 retains all 100 sentences.

さらに別の例では、重要人物（例えば、会議のホスト）が１つの文のみを話した場合、ユーザ固有の要約プログラム１２２は１つの文を保持する。 In yet another example, if an important person (e.g., the meeting host) speaks only one sentence, the user-specific summarization program 122 retains that one sentence.

ステップ４５０で、ユーザ固有の要約プログラム１２２は１つまたは複数のキーワードを識別する。実施形態では、ユーザ固有の要約プログラム１２２は、１つまたは複数の文内の１つまたは複数のキーワードを識別する。実施形態では、ユーザ固有の要約プログラム１２２は、会議の意図またはトピックあるいはその両方に関して１つまたは複数のキーワードを識別する。実施形態では、ユーザ固有の要約プログラム１２２は、会議の２人以上の参加者に関して１つまたは複数のキーワードを識別する。 In step 450, the user-specific summarization program 122 identifies one or more keywords. In an embodiment, the user-specific summarization program 122 identifies one or more keywords in one or more sentences. In an embodiment, the user-specific summarization program 122 identifies one or more keywords related to the intent and/or topic of the meeting. In an embodiment, the user-specific summarization program 122 identifies one or more keywords related to two or more participants of the meeting.

実施形態では、ユーザ固有の要約プログラム１２２は、重み付けを１つまたは複数のキーワードに割り当てる。実施形態では、ユーザ固有の要約プログラム１２２は、会議の意図またはトピックあるいはその両方とのキーワードの関連性に基づいて、重み付けを１つまたは複数のキーワードに割り当てる。実施形態では、ユーザ固有の要約プログラム１２２は、会議の２人以上の参加者とのキーワードの関連性に基づいて、重み付けを１つまたは複数のキーワードに割り当てる。 In an embodiment, the user-specific summarization program 122 assigns a weighting to one or more keywords. In an embodiment, the user-specific summarization program 122 assigns a weighting to one or more keywords based on the relevance of the keyword to the intent and/or topic of the meeting. In an embodiment, the user-specific summarization program 122 assigns a weighting to one or more keywords based on the relevance of the keyword to two or more participants of the meeting.

例えば、３つの文Ｓ１０、Ｓ１１、およびＳ１２は、会議の複数の参加者に関する候補である。特定の参加者と一致するキーワードの頻度に基づいてそのような文をランク付けするためのランク付けシステムが構築される。Ｓ１０、Ｓ１１、およびＳ１２からキーワードが識別される。識別されたキーワードは、「データ」、「アルゴリズム」、および「Ｋ平均クラスタ化」である。参加者Ｘ１０は、データ・サイエンティストである。参加者Ｘ１１は、ソリューション・アーキテクトである。Ｘ１０は、キーワード「データ」、「アルゴリズム」、および「Ｋ平均クラスタ化」を含む文により興味があり、一方、Ｘ１１は、キーワード「データ」を含む文のみに興味がある。 For example, three sentences S10, S11, and S12 are candidates for multiple participants in a meeting. A ranking system is constructed to rank such sentences based on the frequency of keywords that match specific participants. Keywords are identified from S10, S11, and S12. The identified keywords are "data," "algorithm," and "K-means clustering." Participant X10 is a data scientist. Participant X11 is a solution architect. X10 is more interested in sentences containing the keywords "data," "algorithm," and "K-means clustering," while X11 is only interested in sentences containing the keyword "data."

実施形態では、ユーザ固有の要約プログラム１２２は、１つまたは複数の文をタグ付けする。実施形態では、ユーザ固有の要約プログラム１２２は、タグ付けシステムを使用して１つまたは複数の文をタグ付けする。実施形態では、ユーザ固有の要約プログラム１２２は、１つまたは複数の文の意図を表す単語または語句を使用して１つまたは複数の文をタグ付けする。実施形態では、ユーザ固有の要約プログラム１２２が１つまたは複数の文をランク付けすることに応答して、ユーザ固有の要約プログラム１２２は１つまたは複数のキーワードを識別する。 In an embodiment, the user-specific summarization program 122 tags one or more sentences. In an embodiment, the user-specific summarization program 122 tags the one or more sentences using a tagging system. In an embodiment, the user-specific summarization program 122 tags the one or more sentences with words or phrases that express the intent of the one or more sentences. In an embodiment, in response to the user-specific summarization program 122 ranking the one or more sentences, the user-specific summarization program 122 identifies one or more keywords.

例えば、３つの文Ｓ１、Ｓ２、およびＳ３は、一群の人の意欲を引き出すか、または注目を集める目的で話される。タグ付けシステムは、これらの文を「動機付け」または「グループの注目を集める」としてタグ付けするために使用される。３つの文Ｓ４、Ｓ５、およびＳ６は、一群の人の気分を明るくする目的で話される。タグ付けシステムは、これらの文を「気分を明るくする」としてタグ付けするために使用される。３つの文Ｓ７、Ｓ８、およびＳ９は、何の目的もなく話される。タグ付けシステムは、これらの文を「無意味」としてタグ付けするために使用される。 For example, three sentences S1, S2, and S3 are spoken with the purpose of motivating or attracting the attention of a group of people. A tagging system is used to tag these sentences as "motivating" or "attention-grabbing". Three sentences S4, S5, and S6 are spoken with the purpose of lightening the mood of a group of people. A tagging system is used to tag these sentences as "lightening the mood". Three sentences S7, S8, and S9 are spoken without any purpose. A tagging system is used to tag these sentences as "nonsense".

ステップ４５５で、ユーザ固有の要約プログラム１２２はグローバルなランク付けを準備する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ４０５で識別されて重み付けを割り当てられた２人以上の参加者、ステップ４５０で識別されて重み付けを割り当てられた１つまたは複数のキーワード、およびステップ４５５でランク付けされた１つまたは複数の文のサブセットを組み込む、グローバルなランク付けを準備する。実施形態では、ユーザ固有の要約プログラム１２２は、ユーザに合わせてより調整された要約を準備することにおいてユーザ固有の要約プログラム１２２を支援するために、会議から離れたユーザがランク付けの優先順位をくつがえすことを可能にする。実施形態では、ユーザ固有の要約プログラム１２２が１つまたは複数のキーワードを識別することに応答して、ユーザ固有の要約プログラム１２２はグローバルなランク付けを準備する。 In step 455, the user-specific summarization program 122 prepares a global ranking. In an embodiment, the user-specific summarization program 122 prepares a global ranking that incorporates the two or more participants identified and assigned weights in step 405, the one or more keywords identified and assigned weights in step 450, and the subset of one or more sentences ranked in step 455. In an embodiment, the user-specific summarization program 122 allows a user away from the conference to override the ranking priority to assist the user-specific summarization program 122 in preparing a summary that is more tailored to the user. In an embodiment, in response to the user-specific summarization program 122 identifying one or more keywords, the user-specific summarization program 122 prepares a global ranking.

ステップ４６０で、ユーザ固有の要約プログラム１２２は要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、機械学習アルゴリズムを使用して要約を準備する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ４０５で重み付けを割り当てられた参加者ごとに要約を準備する。１つまたは複数の実施形態では、ユーザ固有の要約プログラム１２２は、上位Ｎ個のランク付けされた参加者に関する要約を準備する。Ｎは、ステップ２２０でユーザによって設定されたユーザの嗜好または機械駆動の推奨であってよい。 In step 460, the user-specific summarization program 122 prepares summaries. In embodiments, the user-specific summarization program 122 prepares summaries using a machine learning algorithm. In embodiments, the user-specific summarization program 122 prepares summaries for each participant assigned a weight in step 405. In one or more embodiments, the user-specific summarization program 122 prepares summaries for the top N ranked participants, where N may be a user preference or machine-driven recommendation set by the user in step 220.

例えば、１５人（すなわち、Ｘ１、Ｘ２、．．．、Ｘ１５）が会議に参加する。ユーザ固有の要約プログラム１２２は、会議における参加者の役割、参加者の企業における参加者の役割、および会議から離れたユーザとの参加者の関連性に基づいて、重み付けを各参加者に割り当てる。ユーザ固有の要約プログラム１２２は、上位５人のランク付けされた参加者（すなわち、Ｘ１、Ｘ２、Ｘ３、Ｘ４、およびＸ５）に関する要約を準備する。 For example, 15 people (i.e., X1, X2, ..., X15) participate in a conference. The user-specific summarization program 122 assigns a weighting to each participant based on the participant's role in the conference, the participant's role in the participant's enterprise, and the participant's relevance to the user away from the conference. The user-specific summarization program 122 prepares a summary for the top five ranked participants (i.e., X1, X2, X3, X4, and X5).

実施形態では、ユーザ固有の要約プログラム１２２は、会議から離れたユーザのプロフィールに合わせて、準備される各要約を調整する。実施形態では、ユーザ固有の要約プログラム１２２は、ステップ４５５で準備されたグローバルなランク付けを使用して各要約を調整する。実施形態では、ユーザ固有の要約プログラム１２２は、会議中に話した参加者と会議から離れたユーザの間の関係に比例して一致するように各要約を調整する。会議から離れたユーザとの参加者の関連性が、要因のセットによって決定される。要因のセットは、会議における参加者の役割（すなわち、会議の主催者、会議の要求された出席者、会議の任意選択的な出席者）、参加者の企業プロフィール、参加者の企業の階層、企業の階層内の参加者の地位、参加者の企業内の参加者の年功序列、参加者の主要な仕事、参加者の主要な仕事の職務、ユーザの技術に関する関心、およびユーザの専門知識、参加者が話すことまたは質問に回答することが期待されたため、会議のホストによって参加者の名前が言及されたかどうか、ならびに参加者が会議中に話していたかどうかを含むが、これらに限定されない。 In an embodiment, the user-specific summarizing program 122 tailors each prepared summary to the profile of the user who left the conference. In an embodiment, the user-specific summarizing program 122 tailors each summary using the global ranking prepared in step 455. In an embodiment, the user-specific summarizing program 122 tailors each summary to proportionally match the relationship between the participant who spoke during the conference and the user who left the conference. The participant's relevance to the user who left the conference is determined by a set of factors, including, but not limited to, the participant's role in the conference (i.e., the conference organizer, the conference requested attendee, the conference optional attendee), the participant's company profile, the participant's company hierarchy, the participant's position within the company hierarchy, the participant's seniority within the participant's company, the participant's primary job, the participant's primary job function, the user's technology interests, and the user's expertise, whether the participant was mentioned by name by the conference host because the participant was expected to speak or answer questions, and whether the participant spoke during the conference.

実施形態では、ユーザ固有の要約プログラム１２２は、音声認識信頼度スコアまたは意図信頼度スコアあるいはその両方を使用して各要約を調整する。実施形態では、ユーザ固有の要約プログラム１２２がグローバルなランク付けを準備することに応答して、ユーザ固有の要約プログラム１２２は要約を準備する。 In an embodiment, the user-specific summarization program 122 adjusts each summary using the speech recognition confidence score, the intent confidence score, or both. In an embodiment, the user-specific summarization program 122 prepares a summary in response to the user-specific summarization program 122 preparing a global ranking.

図５は、本発明の実施形態に従う、図１の分散データ処理環境１００内のコンピューティング・デバイス５００のコンポーネントのブロック図である。図５は、単に１つの実装の例を提供しており、さまざまな実施形態が実装され得る環境に関して、どのような制限も意味していないと理解されるべきである。図に示された環境に対して、多くの変更が行われ得る。 Figure 5 is a block diagram of components of a computing device 500 within the distributed data processing environment 100 of Figure 1 in accordance with an embodiment of the present invention. Figure 5 provides merely an example of one implementation and is not intended to imply any limitations with regard to the environments in which various embodiments may be implemented. Many modifications to the illustrated environment may be made.

コンピューティング・デバイス５００は、キャッシュ５１６、メモリ５０６、永続的ストレージ５０８、通信ユニット５１０、および入出力（Ｉ／Ｏ：input/output）インターフェイス５１２の間の通信を提供する通信ファブリック５０２を含んでいる。通信ファブリック５０２は、プロセッサ（マイクロプロセッサ、通信プロセッサ、およびネットワーク・プロセッサなど）、システム・メモリ、周辺機器、およびシステム内の任意の他のハードウェア・コンポーネントの間で、データまたは制御情報あるいはその両方を渡すために設計された、任意のアーキテクチャを使用して実装され得る。例えば、通信ファブリック５０２は、１つまたは複数のバスまたはクロスバ・スイッチを使用して実装され得る。 Computing device 500 includes a communications fabric 502 that provides communication between cache 516, memory 506, persistent storage 508, communications unit 510, and input/output (I/O) interface 512. Communications fabric 502 may be implemented using any architecture designed to pass data and/or control information between processors (such as microprocessors, communications processors, and network processors), system memory, peripherals, and any other hardware components in the system. For example, communications fabric 502 may be implemented using one or more buses or crossbar switches.

メモリ５０６および永続的ストレージ５０８は、コンピュータ可読ストレージ媒体である。この実施形態では、メモリ５０６はランダム・アクセス・メモリ（ＲＡＭ：random access memory）を含んでいる。一般に、メモリ５０６は、任意の適切な揮発性または不揮発性のコンピュータ可読ストレージ媒体を含むことができる。キャッシュ５１６は、メモリ５０６から最近アクセスされたデータ、およびアクセスされたデータに近いデータを保持することによって、コンピュータ・プロセッサ５０４の性能を向上させる高速なメモリである。 Memory 506 and persistent storage 508 are computer-readable storage media. In this embodiment, memory 506 includes random access memory (RAM). In general, memory 506 may include any suitable volatile or non-volatile computer-readable storage medium. Cache 516 is high-speed memory that improves the performance of computer processor 504 by holding recently accessed data and data near the accessed data from memory 506.

キャッシュ５１６を介して各コンピュータ・プロセッサ５０４のうちの１つまたは複数によって実行するため、またはアクセスするため、あるいはその両方のために、プログラムが永続的ストレージ５０８およびメモリ５０６に格納されてよい。実施形態では、永続的ストレージ５０８は、磁気ハード・ディスク・ドライブを含んでいる。磁気ハード・ディスク・ドライブに対する代替または追加として、永続的ストレージ５０８は、半導体ハード・ドライブ、半導体ストレージ・デバイス、読み取り専用メモリ（ＲＯＭ：read-only memory）、消去可能プログラマブル読み取り専用メモリ（ＥＰＲＯＭ：erasable programmable read-only memory）、フラッシュ・メモリ、あるいはプログラム命令またはデジタル情報を格納できる任意の他のコンピュータ可読ストレージ媒体を含むことができる。 Programs may be stored in persistent storage 508 and memory 506 for execution and/or access by one or more of the computer processors 504 via cache 516. In an embodiment, persistent storage 508 includes a magnetic hard disk drive. As an alternative or in addition to a magnetic hard disk drive, persistent storage 508 may include a solid-state hard drive, a solid-state storage device, read-only memory (ROM), erasable programmable read-only memory (EPROM), flash memory, or any other computer-readable storage medium capable of storing program instructions or digital information.

永続的ストレージ５０８によって使用される媒体は、取り外し可能であってもよい。例えば、取り外し可能ハード・ドライブが、永続的ストレージ５０８に使用されてよい。他の例としては、永続的ストレージ５０８の一部でもある別のコンピュータ可読ストレージ媒体に転送するためのドライブに挿入される、光ディスクおよび磁気ディスク、サム・ドライブ、ならびにスマート・カードが挙げられる。 The media used by persistent storage 508 may be removable. For example, a removable hard drive may be used for persistent storage 508. Other examples include optical and magnetic disks, thumb drives, and smart cards that are inserted into a drive for transfer to another computer-readable storage medium that is also part of persistent storage 508.

これらの例において、通信ユニット５１０は、他のデータ処理システムまたはデバイスとの通信を提供する。これらの例において、通信ユニット５１０は、１つまたは複数のネットワーク・インターフェイス・カードを含む。通信ユニット５１０は、物理的通信リンクおよびワイヤレス通信リンクのどちらかまたは両方を使用して通信を提供してよい。プログラムは、通信ユニット５１０を介して永続的ストレージ５０８にダウンロードされてよい。 In these examples, communications unit 510 provides for communication with other data processing systems or devices. In these examples, communications unit 510 includes one or more network interface cards. Communications unit 510 may provide communications using either or both physical and wireless communications links. Programs may be downloaded to persistent storage 508 via communications unit 510.

Ｉ／Ｏインターフェイス５１２は、サーバ１２０またはユーザ・コンピューティング・デバイス１３０あるいはその両方に接続されてよい他のデバイスとのデータの入力および出力を可能にする。例えば、Ｉ／Ｏインターフェイス５１２は、キーボード、キーパッド、タッチ・スクリーン、または他の適切な入力デバイス、あるいはその組合せなどの、外部デバイス５１８への接続を提供してよい。外部デバイス５１８は、例えばサム・ドライブ、ポータブル光ディスクまたはポータブル磁気ディスク、およびメモリ・カードなどの、ポータブル・コンピュータ可読ストレージ媒体を含むこともできる。本発明の実施形態を実践するために使用されるソフトウェアおよびデータは、そのようなポータブル・コンピュータ可読ストレージ媒体に格納することができ、Ｉ／Ｏインターフェイス５１２を介して永続的ストレージ５０８に読み込むことができる。Ｉ／Ｏインターフェイス５１２は、ディスプレイ５２０にも接続する。 I/O interface 512 allows for the input and output of data with other devices that may be connected to server 120 and/or user computing device 130. For example, I/O interface 512 may provide connection to external devices 518, such as a keyboard, keypad, touch screen, or other suitable input device, or a combination thereof. External devices 518 may also include portable computer-readable storage media, such as thumb drives, portable optical or magnetic disks, and memory cards. Software and data used to practice embodiments of the present invention may be stored on such portable computer-readable storage media and loaded into persistent storage 508 via I/O interface 512. I/O interface 512 also connects to display 520.

ディスプレイ５２０は、データをユーザに表示するためのメカニズムを提供し、例えば、コンピュータのモニタであってよい。 Display 520 provides a mechanism for displaying data to a user and may be, for example, a computer monitor.

本明細書に記載されたプログラムは、アプリケーションに基づいて識別され、本発明の特定の実施形態において、そのアプリケーションに関して実装される。ただし、本明細書における特定のプログラムの名前は単に便宜上使用されていると理解されるべきであり、したがって、本発明は、そのような名前によって識別されたか、または暗示されたか、あるいはその両方である特定のアプリケーションのみで使用するように制限されるべきではない。 The programs described herein are identified based on the application for which they are implemented in particular embodiments of the invention. However, it should be understood that the names of specific programs herein are used merely for convenience, and thus the invention should not be limited to use with only the particular application identified and/or implied by such names.

本発明は、システム、方法、またはコンピュータ・プログラム製品、あるいはその組合せであってよい。コンピュータ・プログラム製品は、プロセッサに本発明の態様を実行させるためのコンピュータ可読プログラム命令を含んでいるコンピュータ可読ストレージ媒体を含んでよい。 The present invention may be a system, a method, or a computer program product, or a combination thereof. The computer program product may include a computer-readable storage medium containing computer-readable program instructions for causing a processor to perform aspects of the present invention.

コンピュータ可読ストレージ媒体は、命令実行デバイスによって使用するための命令を保持および格納できる有形のデバイスであることができる。コンピュータ可読ストレージ媒体は、例えば、電子ストレージ・デバイス、磁気ストレージ・デバイス、光ストレージ・デバイス、電磁ストレージ・デバイス、半導体ストレージ・デバイス、またはこれらの任意の適切な組合せであってよいが、これらに限定されない。コンピュータ可読ストレージ媒体のさらに具体的な例の非網羅的リストは、ポータブル・フロッピー（Ｒ）・ディスク、ハード・ディスク、ランダム・アクセス・メモリ（ＲＡＭ）、読み取り専用メモリ（ＲＯＭ）、消去可能プログラマブル読み取り専用メモリ（ＥＰＲＯＭまたはフラッシュ・メモリ）、スタティック・ランダム・アクセス・メモリ（ＳＲＡＭ：static random access memory）、ポータブル・コンパクト・ディスク読み取り専用メモリ（ＣＤ－ＲＯＭ：compact disc read-only memory）、デジタル・バーサタイル・ディスク（ＤＶＤ：digital versatile disk）、メモリ・スティック、フロッピー（Ｒ）・ディスク、命令が記録されているパンチカードまたは溝の中の隆起構造などの機械的にエンコードされるデバイス、およびこれらの任意の適切な組合せを含む。本明細書において使用されるとき、コンピュータ可読ストレージ媒体は、電波または他の自由に伝搬する電磁波、導波管または他の送信媒体を伝搬する電磁波（例えば、光ファイバ・ケーブルを通過する光パルス）、あるいはワイヤを介して送信される電気信号などの、それ自体が一過性の信号であると解釈されるべきではない。 A computer-readable storage medium may be a tangible device capable of holding and storing instructions for use by an instruction execution device. A computer-readable storage medium may be, for example, but is not limited to, an electronic storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination thereof. A non-exhaustive list of more specific examples of computer-readable storage media includes portable floppy disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), static random access memory (SRAM), portable compact disc read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanically encoded devices such as punch cards or ridge-in-groove structures on which instructions are recorded, and any suitable combination thereof. As used herein, computer-readable storage media should not be construed as being ephemeral signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., light pulses passing through fiber optic cables), or electrical signals transmitted over wires.

本明細書に記載されたコンピュータ可読プログラム命令は、コンピュータ可読ストレージ媒体から各コンピューティング・デバイス／処理デバイスへ、またはネットワーク（例えば、インターネット、ローカル・エリア・ネットワーク、広域ネットワーク、またはワイヤレス・ネットワーク、あるいはその組合せ）を介して外部コンピュータまたは外部ストレージ・デバイスへダウンロードされ得る。このネットワークは、銅伝送ケーブル、光伝送ファイバ、ワイヤレス送信、ルータ、ファイアウォール、スイッチ、ゲートウェイ・コンピュータ、またはエッジ・サーバ、あるいはその組合せを備えてよい。各コンピューティング・デバイス／処理デバイス内のネットワーク・アダプタ・カードまたはネットワーク・インターフェイスは、コンピュータ可読プログラム命令をネットワークから受信し、それらのコンピュータ可読プログラム命令を各コンピューティング・デバイス／処理デバイス内のコンピュータ可読ストレージ媒体に格納するために転送する。 The computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to each computing/processing device or to an external computer or external storage device via a network (e.g., the Internet, a local area network, a wide area network, or a wireless network, or a combination thereof). This network may include copper transmission cables, optical fiber transmissions, wireless transmissions, routers, firewalls, switches, gateway computers, and/or edge servers. A network adapter card or network interface within each computing/processing device receives the computer-readable program instructions from the network and forwards the computer-readable program instructions for storage on a computer-readable storage medium within each computing/processing device.

本発明の動作を実行するためのコンピュータ可読プログラム命令は、アセンブラ命令、命令セット・アーキテクチャ（ＩＳＡ：instruction-set-architecture）命令、マシン命令、マシン依存命令、マイクロコード、ファームウェア命令、状態設定データ、あるいはＳｍａｌｌｔａｌｋ（Ｒ）、Ｃ＋＋などのオブジェクト指向プログラミング言語、および「Ｃ」プログラミング言語または同様のプログラミング言語などの従来の手続き型プログラミング言語を含む１つまたは複数のプログラミング言語の任意の組合せで記述されたソース・コードまたはオブジェクト・コードであってよい。コンピュータ可読プログラム命令は、ユーザのコンピュータ上で全体的に実行すること、ユーザのコンピュータ上でスタンドアロン・ソフトウェア・パッケージとして部分的に実行すること、ユーザのコンピュータ上およびリモート・コンピュータ上でそれぞれ部分的に実行すること、あるいはリモート・コンピュータ上またはサーバ上で全体的に実行することができる。後者のシナリオでは、リモート・コンピュータは、ローカル・エリア・ネットワークまたは広域ネットワークを含む任意の種類のネットワークを介してユーザのコンピュータに接続されてよく、または接続は、（例えば、インターネット・サービス・プロバイダを使用してインターネットを介して）外部コンピュータに対して行われてよい。一部の実施形態では、本発明の態様を実行するために、例えばプログラマブル・ロジック回路、フィールドプログラマブル・ゲート・アレイ（ＦＰＧＡ：field-programmable gate arrays）、またはプログラマブル・ロジック・アレイ（ＰＬＡ：programmable logic arrays）を含む電子回路は、コンピュータ可読プログラム命令の状態情報を利用することによってコンピュータ可読プログラム命令を実行し、電子回路をカスタマイズしてよい。 The computer-readable program instructions for carrying out the operations of the present invention may be source or object code written in any combination of one or more programming languages, including assembler instructions, instruction-set-architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state-setting data, or object code written in any combination of one or more programming languages, including object-oriented programming languages such as Smalltalk®, C++, and conventional procedural programming languages such as the "C" programming language or similar programming languages. The computer-readable program instructions may execute entirely on the user's computer, partially on the user's computer as a standalone software package, partially on the user's computer and on a remote computer, or entirely on a remote computer or server. In the latter scenario, the remote computer may be connected to the user's computer via any type of network, including a local area network or a wide area network, or the connection may be to an external computer (e.g., via the Internet using an Internet Service Provider). In some embodiments, to carry out aspects of the present invention, electronic circuits including, for example, programmable logic circuits, field-programmable gate arrays (FPGAs), or programmable logic arrays (PLAs), may execute computer-readable program instructions to customize the electronic circuitry by utilizing state information in the computer-readable program instructions.

本発明の態様は、本明細書において、本発明の実施形態に従って、方法、装置（システム）、およびコンピュータ・プログラム製品のフローチャート図またはブロック図あるいはその両方を参照して説明される。フローチャート図またはブロック図あるいはその両方の各ブロック、ならびにフローチャート図またはブロック図あるいはその両方に含まれるブロックの組合せが、コンピュータ可読プログラム命令によって実装され得るということが理解されるであろう。 Aspects of the present invention are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer-readable program instructions.

これらのコンピュータ可読プログラム命令は、コンピュータまたは他のプログラム可能なデータ処理装置のプロセッサを介して実行される命令が、フローチャートまたはブロック図あるいはその両方の１つまたは複数のブロックに指定される機能／動作を実施する手段を作り出すべく、汎用コンピュータ、専用コンピュータ、または他のプログラム可能なデータ処理装置のプロセッサに提供されてマシンを作り出すものであってよい。これらのコンピュータ可読プログラム命令は、命令が格納されたコンピュータ可読ストレージ媒体がフローチャートまたはブロック図あるいはその両方の１つまたは複数のブロックに指定される機能／動作の態様を実施する命令を含んでいる製品を備えるように、コンピュータ可読ストレージ媒体に格納され、コンピュータ、プログラム可能なデータ処理装置、または他のデバイス、あるいはその組合せに特定の方式で機能するように指示できるものであってもよい。 These computer-readable program instructions may be provided to a processor of a general-purpose computer, special-purpose computer, or other programmable data processing apparatus to create a machine, such that the instructions, executed by the processor of the computer or other programmable data processing apparatus, create means for performing the functions/operations specified in one or more blocks of the flowcharts and/or block diagrams. These computer-readable program instructions may be stored on a computer-readable storage medium and capable of directing a computer, programmable data processing apparatus, or other device, or combination thereof, to function in a particular manner, such that the computer-readable storage medium on which the instructions are stored comprises an article of manufacture containing instructions for performing aspects of the functions/operations specified in one or more blocks of the flowcharts and/or block diagrams.

コンピュータ可読プログラム命令は、コンピュータ上、他のプログラム可能な装置上、または他のデバイス上で実行される命令が、フローチャートまたはブロック図あるいはその両方の１つまたは複数のブロックに指定される機能／動作を実施するように、コンピュータ、他のプログラム可能なデータ処理装置、または他のデバイスに読み込まれてもよく、それによって、一連の動作可能なステップを、コンピュータ上、他のプログラム可能な装置上、またはコンピュータ実装プロセスを生成する他のデバイス上で実行させる。 Computer-readable program instructions may be loaded into a computer, other programmable data processing apparatus, or other device such that the instructions, which execute on the computer, other programmable apparatus, or other device, perform the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams, thereby causing a series of operable steps to be performed on the computer, other programmable apparatus, or other device to produce a computer-implemented process.

図内のフローチャートおよびブロック図は、本発明のさまざまな実施形態に従って、システム、方法、およびコンピュータ・プログラム製品の可能な実装のアーキテクチャ、機能、および動作を示す。これに関連して、フローチャートまたはブロック図内の各ブロックは、規定された論理機能を実装するための１つまたは複数の実行可能な命令を備える、命令のモジュール、セグメント、または部分を表してよい。一部の代替の実装では、ブロックに示された機能は、図に示された順序とは異なる順序で発生してよい。例えば、連続して示された２つのブロックは、実際には、含まれている機能に応じて、実質的に同時に実行されるか、または場合によっては逆の順序で実行されてよい。ブロック図またはフローチャート図あるいはその両方の各ブロック、ならびにブロック図またはフローチャート図あるいはその両方に含まれるブロックの組合せは、規定された機能または動作を実行するか、または専用ハードウェアとコンピュータ命令の組合せを実行する専用ハードウェアベースのシステムによって実装され得るということにも注意する。 The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present invention. In this regard, each block in a flowchart or block diagram may represent a module, segment, or portion of instructions, comprising one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions shown in the blocks may occur out of the order shown in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently or in the reverse order, depending on the functionality involved. It is also noted that each block in the block diagrams and/or flowchart diagrams, and combinations of blocks included in the block diagrams and/or flowchart diagrams, may be implemented by a special-purpose hardware-based system that performs the specified functions or operations or executes a combination of special-purpose hardware and computer instructions.

本発明のさまざまな実施形態の説明は、例示の目的で提示されているが、網羅的であることは意図されておらず、開示された実施形態に制限されない。本発明の範囲から逸脱することなく、多くの変更および変形が、当業者にとって明らかになるであろう。本明細書で使用された用語は、実施形態の原理、実際の適用、または市場で見られる技術を超える技術的改良を最も良く説明するため、または他の当業者が本明細書で開示された実施形態を理解できるようにするために選択されている。 The description of various embodiments of the present invention is presented for illustrative purposes, but is not intended to be exhaustive and is not limited to the disclosed embodiments. Many modifications and variations will become apparent to those skilled in the art without departing from the scope of the present invention. The terms used herein have been selected to best explain the principles, practical applications, or technical improvements of the embodiments beyond those found in the marketplace, or to enable others skilled in the art to understand the embodiments disclosed herein.

Claims

1. A computer-implemented method comprising:
detecting, by one or more processors, a user absent from a virtual conference including at least two participants for a preset period of time or for a preset percentage of the total allotted time of the pre-scheduled virtual conference;
retrieving, by one or more processors, from a database a first set of data regarding the user, a second set of data regarding the at least two participants of the virtual conference, and a third set of data regarding a relationship between the user and the at least two participants of the virtual conference;
preparing, by one or more processors, a summary tailored to the user's profile and covering the portion of the virtual meeting during which the user was disconnected;
detecting, by one or more processors, that the user reconnects to the virtual conference;
determining, by one or more processors, whether the user will review the summary before rejoining the virtual conference based on preset user preferences, decisions made by the user, or machine-driven recommendations;
prompting, by one or more processors, the user to review the summary using a set of default user preferences in response to the user determining that the user will review the summary before rejoining the virtual conference;
and outputting, by one or more processors, the summary to the user.

after outputting the summary to the user, comparing, by one or more processors, the summary with a playback of the complete recording of the virtual conference;
requesting, by one or more processors, feedback from the user;
receiving, by one or more processors, the feedback from the user;
improving, by one or more processors, the accuracy of the prepared future summaries using reinforcement learning;
2. The computer-implemented method of claim 1, further comprising: storing, by one or more processors, the feedback from the user in the database.

preparing the summary tailored to the profile of the user and covering the portion of the virtual meeting while the user was disconnected;
identifying, by one or more processors, the at least two participants of the virtual conference;
assigning, by one or more processors, weights to the at least two participants of the virtual conference based on a first set of factors;
and ranking, by one or more processors, the at least two participants of the virtual conference based on the weightings assigned to the at least two participants.

extracting, by one or more processors, a plurality of audio frames, a plurality of video frames, or a plurality of audio and video frames from the portion of the virtual conference during which the users were disconnected, after ranking the at least two participants of the virtual conference based on the weights assigned to the at least two participants;
4. The computer-implemented method of claim 3, further comprising: identifying, by one or more processors, a context of the plurality of audio frames, the plurality of video frames, or the plurality of audio and video frames to understand one or more topics of the virtual conference.

After identifying the context of the plurality of audio frames, the plurality of video frames, or the plurality of audio and video frames to understand the one or more topics of the virtual conference, selecting, by one or more processors, a subset of the plurality of audio frames, a subset of the plurality of video frames, or a subset of the plurality of audio and video frames based on whether the plurality of audio frames, the plurality of video frames, or the plurality of audio and video frames contribute to preparing a summary for a user who was away from the conference;
ranking, by one or more processors, the subset of the plurality of audio frames, the subset of the plurality of video frames, or the subset of the plurality of audio and video frames using an algorithmic method or a second set of factors;
merging, by one or more processors, the subset of the plurality of audio frames, the subset of the plurality of video frames, or the subset of the plurality of audio and video frames ranked above an algorithmically determined threshold;
5. The computer-implemented method of claim 4, further comprising preparing, by one or more processors, an integrated summary of the subset of the plurality of audio frames ranked above the algorithmically determined threshold, an integrated summary of the subset of the plurality of video frames ranked above the algorithmically determined threshold, and an integrated summary of the subset of the plurality of audio and video frames ranked above the algorithmically determined threshold.

after preparing an integrated summary of the subset of the plurality of audio frames ranked above an algorithmically determined threshold, an integrated summary of the subset of the plurality of video frames ranked above an algorithmically determined threshold, and an integrated summary of the subset of the plurality of audio and video frames ranked above an algorithmically determined threshold, converting, by one or more processors, the integrated summaries into one or more sentences using an extractive text summarization algorithm;
applying, by one or more processors, speaker diarization;
ranking, by one or more processors, the one or more sentences based on a third set of factors; and
and retaining, by one or more processors, a subset of the one or more sentences ranked above a second threshold based on second preset user preferences or second machine-driven recommendations.

identifying, by one or more processors, one or more keywords in the subset of one or more sentences after retaining the subset of one or more sentences ranked above the second threshold based on the second preset user preferences or the second machine-driven recommendations;
assigning, by one or more processors, a weight to the one or more keywords in the subset of the one or more sentences based on relevance of the one or more keywords to the one or more topics of the virtual conference and based on relevance of the one or more keywords to the at least two participants of the virtual conference;
and tagging, by one or more processors, the subset of the one or more sentences with the one or more keywords.

preparing, by one or more processors, a global ranking incorporating the at least two participants, the one or more keywords, and the subset of the one or more sentences after tagging the subset of the one or more sentences with the one or more keywords;
and preparing, by one or more processors, the summary tailored to the user's profile based on the global ranking.

The computer-implemented method of claim 1, wherein the first set of data about the user, the second set of data about the at least two participants of the virtual conference, and the third set of data about the relationships between the user and the at least two participants of the virtual conference collected from the database include a profile created by the user, a company profile of the user, a calendar of the user, one or more profiles created by the at least two participants of the virtual conference, the company profiles of the at least two participants of the virtual conference, the calendars of the at least two participants of the virtual conference, one or more presentation tools used by the at least two participants of the virtual conference, and data from one or more previous virtual conferences hosted, participated in, or attended by the user.

detecting that the user reconnects to the virtual conference;
capturing, by one or more processors, the time during the virtual conference when the user reconnects;
and calculating, by one or more processors, a total duration of time that the user was disconnected from the virtual conference.

The computer-implemented method of claim 1, wherein the set of default user preferences for reviewing the summary includes language preferences, viewing mode preferences, audio preferences, and subtitle preferences.

comparing the summary to the playback of the complete recording of the virtual meeting;
extracting, by one or more processors, a first set of one or more key entities from the summary and a second set of one or more key entities from the full recording of the virtual meeting;
3. The computer-implemented method of claim 2, further comprising: matching, by one or more processors, the first set of one or more key entities from the summary with a second set of one or more key entities from the full recording of the virtual meeting.

requesting feedback from the user;
3. The computer-implemented method of claim 2, further comprising: providing, by one or more processors, the user with three or more choices representing a satisfaction level of the user, the three or more choices comprising: dissatisfied, neutral, and satisfied.

The computer-implemented method of claim 3, wherein the first set of factors includes the roles of the at least two participants in the virtual conference, the roles of the at least two participants in an enterprise, and the relationship of the at least two participants to the user remote from the virtual conference.

The computer-implemented method of claim 5, wherein the algorithmically determined threshold is dynamic and tailored to the training data set.

The computer-implemented method of claim 5, wherein the second set of factors includes whether the frame includes the name of the user who has left the virtual conference, the ranking of the at least two participants, and the relationship of the at least two participants to the user who has left the virtual conference based on the user's technology interests, the user's expertise, or past analysis.

Applying speaker diarization
detecting, by one or more processors, changes between the at least two participants in the plurality of audio frames;
The computer-implemented method of claim 6 , further comprising: grouping, by one or more processors, segments of audio together based on characteristics of the at least two participants.

The computer-implemented method of claim 6, wherein the third set of factors includes roles of the at least two participants in the virtual meeting and relevance of the one or more statements to the one or more topics of the virtual meeting.

A computer program causing a computer to execute the method of any one of claims 1 to 18.

one or more processors;
one or more computer-readable storage media;
and program instructions collectively stored on said one or more computer-readable storage media for execution by at least one of said one or more processors , said stored program instructions including program instructions for performing the method of any one of claims 1 to 18.