PDFファイルを知識ソースとして使用する場合、ファイルをアップロードした後、以下のチャンク分割方式が適用されます:
デフォルトチャンク方式
この方式は、例えばPDFの中にある表に関して、改良された方式でPDFからコンテンツを抽出します。これは、抽出されたコンテンツを固定のトークン長に基づいてチャンクに分割します。
この方法は、[Search Extract Output Node]のTop Kを5に設定したものと組み合わせて使用することをお勧めします。
代替チャンク方式
この方式は、抽出とチャンキングの代替方式として使用できます。この方式は、元のPDFが二重改行を使用して論理的な段落に分割されている場合に効果的に機能します。この方法では、システムはセクションなどの構造を判別し、それぞれを1つ以上のチャンクに分割します。
この方式は、アップロードされたファイルのファイル名にcognigy.preset_uiolc_ls.pdf
が付加されている場合に適用されます。例えば、cognigy.preset_uiolc_ls.pdf
は、cognigy
が最初のファイル名、.preset_uiolc_ls
がAlternative Chunk Strategy
を適用するプリセット、.pdf
がファイル拡張子です。
テキスト処理中、システムはPDFから視覚的に複雑なヘッダやリストのような複合要素を省略し、そのようなテキストをチャンクに含めないことがあることにご留意ください。したがって、テキストはシンプルで、複雑な要素を含まないことが望ましいです。
例
CognigyブログからPDFで以下のテキストがあるとします:
備考
# Call Tracing: Unlock New Dimensions in VUX Optimization
In the burgeoning field of voice technology, optimizing Voice User Experience (VUX) is a sophisticated endeavor fraught with unique challenges.
VUX designers confront intricate variables from timing nuances and intonation accuracy to the unpredictability of human speech and environmental interference.
These factors often present hurdles that can transform an otherwise fluid dialogue into a disjointed exchange.
For enterprises aiming to scale and perfect their voice-enabled services, these are not mere technicalities but critical pivots on the customer experience journey.
Recognizing this complex landscape, at Cognigy, we’re excited to introduce the ultimate tool for developers and VUX designers in their quest for voice excellence: Call Tracing.
## Visualizing the Unseen: How Call Tracing Changes the Game
Traditionally, understanding and monitoring voice interaction requires painstaking analysis of transcriptions, user feedback, and basic audio playback.
Now, Cognigy’s Call Tracing feature offers an actionable visualization of all activities taking place during the call, such as speech recognition and connections, together with the call recording waveforms.
This powerful tool grants VUX designers an unparalleled view of the dynamics of voice interactions.
Event and soundwave visualization provides an analytical perspective that goes beyond the audio, giving professionals the ability to dissect and understand voice experiences at a granular level.
It brings critical elements to the forefront that might not be immediately evident in transcriptions or basic audio playback.
このcall-tracing-sample.pdfファイルを.pdf形式でダウンロードすることもできます。
デフォルトの場合
デフォルトチャンク方式の場合、システムはこのテキストを3つの等しいチャンクに分割します。
デフォルトチャンク分割
1.
Call Tracing: Unlock New
Dimensions in VUX
Optimization
In the burgeoning field of voice technology, optimizing Voice User Experience (VUX) is a
sophisticated endeavor fraught with unique challenges. VUX designers confront intricate
variables from timing nuances and intonation accuracy to the unpredictability of human
speech and environmental interference.
These factors often present hurdles that can transform an otherwise fluid dialogue into a
disjointed exchange. For enterprises aiming to scale and perfect their voice-enabled
services, these are not mere technicalities but critical pivots on the customer experience
journey.
Recognizing this complex landscape, at
2.
Cognigy, we’re excited to introduce the ultimate
tool for developers and VUX designers in their quest for voice excellence: Call Tracing.
Visualizing the Unseen: How Call Tracing Changes the
Game
Traditionally, understanding and monitoring voice interaction requires painstaking
analysis of transcriptions, user feedback, and basic audio playback. Now, Cognigy’s
Call Tracing feature offers an actionable visualization of all activities taking place during
the call, such as speech recognition and connections, together with the call recording
waveforms. This powerful tool grants VUX designers an unparalleled view of the
dynamics of voice interactions
3.
.
Event and soundwave visualization provides an analytical perspective that goes beyond
the audio, giving professionals the ability to dissect and understand voice experiences
at a granular level. It brings critical elements to the forefront that might not be
immediately evident in transcriptions or basic audio playback.
代替の場合
代替チャンク方式の場合、システムはこのテキストを5つの論理チャンクに分割します。
最初のチャンクでは、タイトルが複雑な要素としてフォーマットされているため、システムによってタイトルが省略されていますが、デフォルトの方式では同じタイトルが含まれていることにご注意ください。
代替チャンク分割
1.
In the burgeoning field of voice technology, optimizing Voice User Experience (VUX)
is a sophisticated endeavor fraught with unique challenges.
VUX designers confront intricate variables from timing nuances and intonation accuracy
to the unpredictability of human speech and environmental interference.
2.
These factors often present hurdles that can transform an otherwise fluid dialogue
into a disjointed exchange.
For enterprises aiming to scale and perfect their voice-enabled services,
these are not mere technicalities but critical pivots on the customer experience journey.
3.
Recognizing this complex landscape, at Cognigy,
we’re excited to introduce the ultimate tool for developers
and VUX designers in their quest for voice excellence: Call Tracing.
4.
Traditionally, understanding and monitoring voice interaction requires painstaking analysis of
transcriptions, user feedback, and basic audio playback.
Now, Cognigy’s Call Tracing feature offers an actionable visualization of all activities taking place during the call,
such as speech recognition and connections, together with the call recording waveforms.
This powerful tool grants VUX designers an unparalleled view of the dynamics of voice interactions.
5.
Event and soundwave visualization provides an analytical perspective that goes beyond the audio,
giving professionals the ability to dissect and understand voice experiences at a granular level.
It brings critical elements to the forefront that might not be
immediately evident in transcriptions or basic audio playback.