--- title: Whitespace description: Tokenizes text by splitting on whitespace canonical: https://docs.paradedb.com/documentation/tokenizers/available-tokenizers/whitespace --- The whitespace tokenizer splits only on whitespace. It also [lowercases](/documentation/token-filters/lowercase) characters by default. ```sql CREATE INDEX search_idx ON mock_items USING bm25 (id, (description::pdb.whitespace)) WITH (key_field='id'); ``` To get a feel for this tokenizer, run the following command and replace the text with your own: ```sql SELECT 'Tokenize me!'::pdb.whitespace::text[]; ``` ```ini Expected Response text ---------------- {tokenize,me!} (1 row) ```