PDF

PDFファイルを知識ソースとして使用する場合、ファイルをアップロードした後、以下のチャンク分割方式が適用されます:

デフォルトチャンク方式

この方式は、例えばPDFの中にある表に関して、改良された方式でPDFからコンテンツを抽出します。これは、抽出されたコンテンツを固定のトークン長に基づいてチャンクに分割します。

この方法は、[Search Extract Output Node]Top Kを5に設定したものと組み合わせて使用することをお勧めします。

代替チャンク方式

この方式は、抽出とチャンキングの代替方式として使用できます。この方式は、元のPDFが二重改行を使用して論理的な段落に分割されている場合に効果的に機能します。この方法では、システムはセクションなどの構造を判別し、それぞれを1つ以上のチャンクに分割します。

この方式は、アップロードされたファイルのファイル名にcognigy.preset_uiolc_ls.pdfが付加されている場合に適用されます。例えば、cognigy.preset_uiolc_ls.pdfは、cognigyが最初のファイル名、.preset_uiolc_lsAlternative Chunk Strategyを適用するプリセット、.pdfがファイル拡張子です。

テキスト処理中、システムはPDFから視覚的に複雑なヘッダやリストのような複合要素を省略し、そのようなテキストをチャンクに含めないことがあることにご留意ください。したがって、テキストはシンプルで、複雑な要素を含まないことが望ましいです。

CognigyブログからPDFで以下のテキストがあるとします:

備考


# Call Tracing: Unlock New Dimensions in VUX Optimization

In the burgeoning field of voice technology, optimizing Voice User Experience (VUX) is a sophisticated endeavor fraught with unique challenges. 
VUX designers confront intricate variables from timing nuances and intonation accuracy to the unpredictability of human speech and environmental interference.
These factors often present hurdles that can transform an otherwise fluid dialogue into a disjointed exchange. 
For enterprises aiming to scale and perfect their voice-enabled services, these are not mere technicalities but critical pivots on the customer experience journey.
Recognizing this complex landscape, at Cognigy, we’re excited to introduce the ultimate tool for developers and VUX designers in their quest for voice excellence: Call Tracing.

## Visualizing the Unseen: How Call Tracing Changes the Game 

Traditionally, understanding and monitoring voice interaction requires painstaking analysis of transcriptions, user feedback, and basic audio playback. 
Now, Cognigy’s Call Tracing feature offers an actionable visualization of all activities taking place during the call, such as speech recognition and connections, together with the call recording waveforms. 
This powerful tool grants VUX designers an unparalleled view of the dynamics of voice interactions.
Event and soundwave visualization provides an analytical perspective that goes beyond the audio, giving professionals the ability to dissect and understand voice experiences at a granular level. 
It brings critical elements to the forefront that might not be immediately evident in transcriptions or basic audio playback.

このcall-tracing-sample.pdfファイルを.pdf形式でダウンロードすることもできます。

デフォルトの場合

デフォルトチャンク方式の場合、システムはこのテキストを3つの等しいチャンクに分割します。

デフォルトチャンク分割

1.

Call Tracing: Unlock New
  Dimensions in VUX
  Optimization
  In the burgeoning field of voice technology, optimizing Voice User Experience (VUX) is a
  sophisticated endeavor fraught with unique challenges. VUX designers confront intricate
  variables from timing nuances and intonation accuracy to the unpredictability of human
  speech and environmental interference.
  These factors often present hurdles that can transform an otherwise fluid dialogue into a
  disjointed exchange. For enterprises aiming to scale and perfect their voice-enabled
  services, these are not mere technicalities but critical pivots on the customer experience
  journey.
  Recognizing this complex landscape, at

2.

 Cognigy, we’re excited to introduce the ultimate
  tool for developers and VUX designers in their quest for voice excellence: Call Tracing.
  Visualizing the Unseen: How Call Tracing Changes the
  Game
  Traditionally, understanding and monitoring voice interaction requires painstaking
  analysis of transcriptions, user feedback, and basic audio playback. Now, Cognigy’s
  Call Tracing feature offers an actionable visualization of all activities taking place during
  the call, such as speech recognition and connections, together with the call recording
  waveforms. This powerful tool grants VUX designers an unparalleled view of the
  dynamics of voice interactions

3.

.
Event and soundwave visualization provides an analytical perspective that goes beyond
the audio, giving professionals the ability to dissect and understand voice experiences
at a granular level. It brings critical elements to the forefront that might not be
immediately evident in transcriptions or basic audio playback.

代替の場合

代替チャンク方式の場合、システムはこのテキストを5つの論理チャンクに分割します。

最初のチャンクでは、タイトルが複雑な要素としてフォーマットされているため、システムによってタイトルが省略されていますが、デフォルトの方式では同じタイトルが含まれていることにご注意ください。

代替チャンク分割

1.

In the burgeoning field of voice technology, optimizing Voice User Experience (VUX)
  is a sophisticated endeavor fraught with unique challenges. 
  VUX designers confront intricate variables from timing nuances and intonation accuracy 
  to the unpredictability of human speech and environmental interference.

2.

These factors often present hurdles that can transform an otherwise fluid dialogue
  into a disjointed exchange. 
  For enterprises aiming to scale and perfect their voice-enabled services, 
  these are not mere technicalities but critical pivots on the customer experience journey.

3.

Recognizing this complex landscape, at Cognigy, 
we’re excited to introduce the ultimate tool for developers
and VUX designers in their quest for voice excellence: Call Tracing.

4.

Traditionally, understanding and monitoring voice interaction requires painstaking analysis of 
transcriptions, user feedback, and basic audio playback. 
Now, Cognigy’s Call Tracing feature offers an actionable visualization of all activities taking place during the call,
such as speech recognition and connections, together with the call recording waveforms. 
This powerful tool grants VUX designers an unparalleled view of the dynamics of voice interactions.

5.

Event and soundwave visualization provides an analytical perspective that goes beyond the audio, 
giving professionals the ability to dissect and understand voice experiences at a granular level. 
It brings critical elements to the forefront that might not be 
immediately evident in transcriptions or basic audio playback.

詳細はこちら