Package dev.langchain4j.data.document.splitter


package dev.langchain4j.data.document.splitter
  • Classes
    Class
    Description
    Splits the provided Document into characters and attempts to fit as many characters as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
    Splits the provided Document into lines and attempts to fit as many lines as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
    Splits the provided Document into paragraphs and attempts to fit as many paragraphs as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
    Splits the provided Document into parts using the provided regex and attempts to fit as many parts as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
    Splits the provided Document into sentences and attempts to fit as many sentences as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
    Splits the provided Document into words and attempts to fit as many words as possible into a single TextSegment, adhering to the limit set by maxSegmentSize.
     
    Base class for hierarchical document splitters.