dhlab.text.chunking

Module Contents

Classes

Chunks

Create chunks from a text.

API

class dhlab.text.chunking.Chunks(urn=None, chunks=1000)

Create chunks from a text.

Initialization

Parameters:
  • urn – str or list

  • chunks – {‘para’, ‘avsn’} or int

to_pandas()

Vectorize into a pandas dataframe with words a index