This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python3 | |
""" | |
Multi-process FTS (Full-Text Search) benchmark for LanceDB | |
Benchmarks FTS queries on the 600M row Reddit comments dataset. | |
Requires two text files: | |
- words.txt: List of words for FTS queries (one per line) | |
- subreddits.txt: List of subreddits for filters (one per line) | |
Each query uses exactly the specified number of words and filters. |