Astro - Hacker News

6 comments

edinetdb 3 hours ago

This is great for analytical workloads. I work with financial time series data (Japanese company filings) and have been using BigQuery with in-memory caching for the hot path. Curious whether DuckDB extensions like this could replace the BQ dependency for smaller datasets — the cold start + query cost model of serverless warehouses can be painful for API-serving use cases.
jarpineh 18 hours ago

As an aside, there's now Lance data format support in DuckDB through their extension. It has Lance's vector search support available among other things:
https://github.com/lance-format/lance-duckdb/tree/main?tab=r...
I just noticed this, and your post, and haven't yet checked neither (sorry). I'm however doing some vector search benchmarking soon, with DuckDB's options alongside others. So your work caught my attention here.
wolfgangK a day ago

Nice ! My most pressing request for VSS would be efficient binary vectors : is this on the table ?
[-]
- cigrainger 21 hours ago
  
  I haven't given binary vectors a lot of thought, but I'm exploring RaBitQ[1].
  [1] https://arxiv.org/abs/2405.12497
swaminarayan 20 hours ago

Does your method work better than standard ANN when filters are very strict—and how does it affect speed vs accuracy?
esafak a day ago

Please upstream it.