Releases: chroma-core/chroma
0.5.15
Version: 0.5.15
Git ref: refs/tags/0.5.15
Build Date: 2024-10-16T08:58
PIP Package: chroma-0.5.15.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.15
DockerHub Image: chromadb/chroma:0.5.15
What's Changed
- [CHORE] Bump rust version and CI/CD to 1.81.0 by @rescrv in #2958
- [RELEASE] 0.5.15 by @itaismith in #2959
Full Changelog: 0.5.14...0.5.15
0.5.13
Version: 0.5.13
Git ref: refs/tags/0.5.13
Build Date: 2024-10-10T23:05
PIP Package: chroma-0.5.13.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.13
DockerHub Image: chromadb/chroma:0.5.13
What's Changed
- [ENH] Make block and sparse index caches use persistent type. by @rescrv in #2904
- [ENH] Use binary search for gt/gte/lt/lte by @Sicheng-Pan in #2894
- [ENH] Deployment guide updates by @itaismith in #2906
- [BUG]: Lint job - python setup failure by @tazarov in #2924
- [BUG]: Log purge correctness by @tazarov in #2923
- [CLN] Make delete return None by @itaismith in #2880
- [BUG] Remove callouts to discord production support in docs by @itaismith in #2929
- [RELEASE] 0.5.13 by @itaismith in #2930
Full Changelog: 0.5.12...0.5.13
0.5.12
Version: 0.5.12
Git ref: refs/tags/0.5.12
Build Date: 2024-10-08T21:42
PIP Package: chroma-0.5.12.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.12
DockerHub Image: chromadb/chroma:0.5.12
What's Changed
- [CHORE] Switch to debug_struct over f.write by @HammadB in #2867
- [CHORE] Cleanup some clippy errors and all warnings for rust/types by @rescrv in #2858
- [CHORE] Remove the need to pop from the disk cache during compaction by @rescrv in #2865
- [CHORE] clean up cache, distance, index, storage clippy warnings by @rescrv in #2860
- [CHORE] Make a ChromaError type for RuntimeTypeError by @rescrv in #2866
- [ENH] We own the
chroma
crate by @rescrv in #2870 - hosted in early access by @jeffchuber in #2874
- [CHORE] Clippy for more. by @rescrv in #2871
- [ENH]: 21x faster full text querying by @codetheweb in #2833
- [ENH] Implement
SignedRoaringBitmap
and update where/where_document AST by @Sicheng-Pan in #2846 - [ENH] Implement
MetadataProvider
,RoaringMetadataFilter
, and refactorMetadataFilteringOperator
by @Sicheng-Pan in #2847 - [CLN] Refactor
MergeMetadataResultsOperator
by @Sicheng-Pan in #2848 - [CLN] Cleanup codebase with the refactored metadata filtering pipeline by @Sicheng-Pan in #2849
- [CHORE] Finish clippy work. Enable it. Make docs not warn too. by @rescrv in #2879
- [ENH] Implement serde for SparseIndex by @HammadB in #2881
- [ENH] Change the semantics of
$ne
,$nin
, and$not_contains
for local Chroma by @Sicheng-Pan in #2875 - [CHORE] Cleanup clippy warnings from merges pre-clippy. by @rescrv in #2885
- [TST] Move clippy into the lint job by @HammadB in #2886
- [ENH]: add tenant, database, and collection IDs to trace when available by @codetheweb in #2889
- [TST] Move rust benchmark crate to test crate by @Sicheng-Pan in #2900
- [ENH] Disk and memory-backed cache with Foyer 0.10. by @rescrv in #2890
- [TST] Implement utility functions to instantiate blockfile provider by @Sicheng-Pan in #2901
- [TST] Bound hypothesis version for breaking changes by @Sicheng-Pan in #2907
- [TST] Implement utility functions to instantiate segment by @Sicheng-Pan in #2902
- [ENH] RateLimitEnforcer abstract class by @drewkim in #2898
- [ENH] Automatically set version release as the latest release on Github by @itaismith in #2908
- [TST] Benchmark metadata filtering by @Sicheng-Pan in #2903
- [TST] Enable tests for $ne, $nin, and $not_contains for hosted Chroma by @Sicheng-Pan in #2909
- [RELEASE] 0.5.12 by @itaismith in #2913
Full Changelog: 0.5.11...0.5.12
0.5.11
Version: 0.5.11
Git ref: refs/tags/0.5.11
Build Date: 2024-09-27T00:35
PIP Package: chroma-0.5.11.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.11
DockerHub Image: chromadb/chroma:0.5.11
What's Changed
- [CHORE] cleanup Debug of HnswIndexProvider by @rescrv in #2864
- [CHORE] Cleanup clippy warnings in rust/blockstore by @rescrv in #2859
- [PERF] Convert embeddings representation to numpy by @drewkim in #2803
- [RELEASE] 0.5.11 by @itaismith in #2869
Full Changelog: 0.5.10...0.5.11
0.5.10
Version: 0.5.10
Git ref: refs/tags/0.5.10
Build Date: 2024-09-27T00:28
PIP Package: chroma-0.5.10.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.10
DockerHub Image: chromadb/chroma:0.5.10
What's Changed
- [RELEASE] 0.5.10 by @itaismith in #2862
Full Changelog: 0.5.9...0.5.10
0.5.7
What's Changed
- [ENH] Add support for FE tolerations by @HammadB in #2560
- [ENH] Support FE replica count by @HammadB in #2561
- [ENH] fix types export for CJS consumers/NPM types badge by @codetheweb in #2563
- [ENH] CIP: Write-Ahead Log Pruning & Vacuuming by @codetheweb in #2498
- [BUG]: Pydantic 1.9+ compatibility by @tazarov in #2229
- [BUG] HNSW Rust params should have defaults by @HammadB in #2564
- [ENH] enable retries on all gRPC channels by @codetheweb in #2572
- [TST] add hypothesis collection id to logs by @sanketkedia in #2573
- [ENH] return
chroma-trace-id
header, include trace ID in thrown errors by @codetheweb in #2570 - [ENH] Connection pool FD leak v2 by @tazarov in #2014
- [ENH] Properly display memberlist in SysDB by @Ishiihara in #2565
- [CHORE] Add logs to memberlist by @HammadB in #2583
- [CHORE] Add more debugs to memberlist by @HammadB in #2584
- [PERF] Trace auth requests by @HammadB in #2589
- [ENH] simplify logic for when to persist index changes (re-apply with fix) by @codetheweb in #2545
- [ENH] add
.clean_log()
to Producers by @codetheweb in #2549 - [ENH] automatically clean WAL by @codetheweb in #2557
- [ENH] add vacuum CLI command by @codetheweb in #2519
- [BUG]: Pydantic 1.x Union Types by @tazarov in #2577
- [PERF] Add sysdb catalog traces by @HammadB in #2593
- [ENH] Add sysdb cpu/memory requests and limits to helm chart by @HammadB in #2596
- [BUG] Sysdb k8s resource typo by @HammadB in #2600
- [ENH] JS Client Refactor (Full Stack) by @AlabasterAxe in #2542
- [ENH] add tracing to gorm by @codetheweb in #2606
- [ENH] propogate nested error message by @codetheweb in #2608
- [BUG] fix nil dereference during error log by @codetheweb in #2609
- [ENH] prefetch APIs for Record segment and blockfile by @sanketkedia in #2547
- [ENH] Dispatch prefetch() as operator by @sanketkedia in #2562
- [ENH] Tag operators as IO or other + dispatcher schedules IO tasks on the main runtime by @sanketkedia in #2603
- [TST] Isolated Test Environments using testcontainers by @AlabasterAxe in #2607
- [ENH] multipart S3 file uploads by @codetheweb in #2590
- [CLN] Refactor rust codebase into many crates by @HammadB in #2618
- [ENH]: only use multipart upload if object size > part size by @codetheweb in #2622
- [CLN] Move distance, and index into their own crates by @HammadB in #2624
- [BUG]: fix nightly test workflow by @codetheweb in #2635
- [BLD]: fix JS dev release workflow by @codetheweb in #2636
- [BUG] fix multi collection log purge by @codetheweb in #2617
- [ENH]: don't allow a null collection ID on segments by @codetheweb in #2626
- [CHORE] Move pull log trace record count to structured field by @HammadB in #2642
- [PERF] Make pull logs an I/O operator so it runs on main runtime by @HammadB in #2641
- [BUG] S3 client not respecting timeout by @HammadB in #2643
- [ENH] gRPC segment requests should include the collection ID by @codetheweb in #2613
- [PERF]: bump compaction partition size to 10k by @codetheweb in #2647
- [ENH]: allow configuring replica count of compaction service by @codetheweb in #2650
- [BUG]: fix tracing for IO tasks by @codetheweb in #2652
- [ENH]: fix compactor traces, add additional info to trace by @codetheweb in #2653
- [ENH]: add record count to span when submitting embeddings by @codetheweb in #2654
- [ENH] fix embedding validation span pollution by @codetheweb in #2658
- [ENH]: lift frontend gRPC retry to application & trace by @codetheweb in #2659
- [ENH] Fetch blocks in parallel and make load_blocks take a slice instead by @HammadB in #2662
- [CLN] Only use distance avx/neon/sse if enabled by @HammadB in #2664
- [BLD]: fix nightly tests pod log upload by @codetheweb in #2670
- [ENH] Make Rust/C++ FFI error handling robust by @HammadB in #2667
- [ENH] Add timeouts to FE grpc clients by @HammadB in #2671
- [ENH]: allow using node selectors for pods instead of tolerations by @codetheweb in #2672
- [ENH] [PERF] add indices to metadata columns by @levand in #2623
- [BLD]: bump
time
dependency by @codetheweb in #2679 - [ENH]: skip brute force search when log is empty by @codetheweb in #2666
- [ENH]: allow configuring replicaCount and nodeSelector on log service by @codetheweb in #2678
- [ENH]: improve compaction traces by @codetheweb in #2680
- [ENH] Network Admission Control APIs by @sanketkedia in #2628
- [ENH] Construct and pass NAC by @sanketkedia in #2630
- [ENH] Block manager and HNSW provider consume Network Admission Control by @sanketkedia in #2631
- [ENH] NAC rate limits requests by @sanketkedia in #2632
- [ENH]: use
follows_from()
standard instead of manually adding parent attribute by @codetheweb in #2691 - [ENH]: cleanup HNSW temporary files after registering result by @codetheweb in #2646
- [PERF] Multipart s3 downloads passing through NAC by @sanketkedia in #2698
- [CLN] Reorganize delta module into seperate module and split out impls and [PERF] Refactor bf get_size to avoid nested loops by @HammadB in #2674
- [ENH] Evict other versions of hnsw index of the collection when another version is fetched by @sanketkedia in #2707
- [ENH]: add metric for total # of uncompacted log records by @codetheweb in #2696
- [ENH] Add gc for log service by @Ishiihara in #2634
- [BUG]: fix log service metrics service name by @codetheweb in #2713
- [PERF] Binary search get_prefix() calls in blockfile by @sanketkedia in #2717
- [ENH]: update JS CloudClient to accept
tenant
parameter by @codetheweb in #2726 - [ENH]: add config fields for max partition size & max compaction size in compactor by @codetheweb in #2732
- [PERF] Decrease compaction RAM usage and increase speed by @sanketkedia in #2729
- [PERF]: remove mutex around tokenizer by @codetheweb in #2735
- [PERF]: better locking of uncommitted tracking maps (decrease compaction time by 3x) by @codetheweb in #2736
- [ENH]: purge block cache after compaction by @codetheweb in #2741
- [PERF] Move write to blockfiles off main runtime by @sanketkedia in #2740
- [ENH] Synchronize post s3.get() operations in providers by @sanketkedia in #2712
- [ENH]: make test_sanity recall based by @codetheweb in #2742
- [PERF] Add include metadata to MetadataReader to avoid unncessary payload size by @HammadB in #2750
- [PERF]: speed up
get_target_block_id()
by @codetheweb in #2743 - [ENH]: skip querying
MetadataSegmentReader
for empty where clause by @codetheweb in https://githu...
0.5.5
Version: 0.5.5
Git ref: refs/tags/0.5.5
Build Date: 2024-07-23T01:01
PIP Package: chroma-0.5.5.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.5
DockerHub Image: chromadb/chroma:0.5.5
What's Changed
- [PERF] Better tracing for storage access by @HammadB in #2500
- [BUG] Fix cross version persistence tests after 0.5.4 release by @HammadB in #2501
- [BLD] reduce max number of examples on slow prop test preset by @codetheweb in #2504
- [PERF] Trace block read size by @HammadB in #2505
- [ENH] Binary search for blockfile get() by @sanketkedia in #2506
- [ENH] Debuggable test_sanity by @sanketkedia in #2507
- [PERF] Materialization traces by @HammadB in #2508
- [ENH] Add collection id as log entry in test_embeddings by @sanketkedia in #2472
- [BUG] fix issue when deleting->adding same ID before index is persisted by @codetheweb in #2512
- [BUG] fix persistent HNSW parameter migration by @codetheweb in #2511
- [ENH] updates & invalid operations should also trigger persisting of local HNSW by @codetheweb in #2499
- [BUG] fix invalid operation persist check by @codetheweb in #2520
- [ENH] delete collection during teardown of state machine by @codetheweb in #2521
- [ENH] Use foyer for block cache by @Ishiihara in #2431
- [ENH] Introduce stream abstraction and enable concurrency test for blockfile by @Ishiihara in #2454
- [ENH] Allow cache eviction for HNSW provider by @Ishiihara in #2484
- [BUG] CHROMA_SERVER_NOFILE cannot be overriden with env var in container by @tazarov in #2209
- [ENH] Use instrumented tracing for HNSW provider file read by @Ishiihara in #2525
- [BLD] Allow query/compactor to specify tolerations in k8s by @HammadB in #2527
- [RELEASE] Release helm 0.1.4 by @HammadB in #2530
- [CLN] Make LocalStorage use sync filesystem APIs by @Ishiihara in #2531
- [BUG] fix HNSW param defaults in new configuration logic & require batch_size < sync_threshold by @codetheweb in #2526
- [BUG]: fix missing record in query result when many records were deleted and pending persist by @codetheweb in #2532
- [CLN] Move max_block_size_bytes to block_manager_config by @Ishiihara in #2535
- [TST] Get ts-integration-test.sh script working locally by @AlabasterAxe in #2518
- [ENH] simplify logic for when to persist index changes by @codetheweb in #2539
- [ENH] Add an init method that eagerly reports errors with the tenant or DB by @AlabasterAxe in #2537
- [BUG] debug nightly tests timing out by @codetheweb in #2538
- Revert "[ENH] simplify logic for when to persist index changes" by @codetheweb in #2544
- [TST] Enable cache eviction for shuttle concurrency test by @Ishiihara in #2546
- [ENH] Add error trace in log service for GetAllCollectionInfoToCompact by @sanketkedia in #2548
- [BLD] log artifacts per python version by @sanketkedia in #2556
- [CHORE] bump hnswlib by @codetheweb in #2558
- [RELEASE] 0.5.5 by @codetheweb in #2559
Full Changelog: 0.5.4...0.5.5
0.5.4
Version: 0.5.4
Git ref: refs/tags/0.5.4
Build Date: 2024-07-11T02:31
PIP Package: chroma-0.5.4.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.4
DockerHub Image: chromadb/chroma:0.5.4
What's Changed
- [ENH] Timeout log client by @HammadB in #2369
- [ENH] Timeout for s3 storage by @HammadB in #2370
- [ENH] Add timeout for sysdb client. Also add interceptor by @HammadB in #2371
- [ENH] use httpx over requests by @codetheweb in #2336
- [BLD] Fix for JS package in GH by @tazarov in #2345
- [ENH] Enable test_embeddings by @sanketkedia in #2376
- [ENH] Emulate sqlite fts in brute force metadata filtering by @sanketkedia in #2366
- [ENH] handle panics in query service by @codetheweb in #2375
- [BUG]: Fix cross version compatibility 0.5.0 or lower with 0.5.1+ by @tazarov in #2378
- [DOC]: Fix a typo in comment section in chromadb/db/system.py by @imaffe in #2386
- [ENH] 1965 Split up embedding functions by @nablabits in #2034
- [ENH] Min compaction size by @HammadB in #2346
- [ENH] add panics to tracing by @codetheweb in #2382
- [BLD] deploy preview of docs in PRs by @codetheweb in #2391
- [BLD] remove old artifacts from bleeding release by @codetheweb in #2392
- Revert "[ENH] 1965 Split up embedding functions" by @atroyn in #2394
- [BUG] 1965 Split up embedding functions - Redux by @atroyn in #2395
- [DOC] fix: add github link and update discord invite link by @zacatac in #2387
- [BUG] debug proto not included in release by @codetheweb in #2411
- [BLD] slight speed up by caching built Rust dependencies by @codetheweb in #2413
- [ENH] openapi types for better codegen by @jeffchuber in #2416
- [BLD] use Depot by @codetheweb in #2414
- [CLN] Remove test file by @HammadB in #2423
- [TST] Property Test Generation Fixes by @HammadB in #2383
- [PERF] Use binary search in positional posting list by @HammadB in #2424
- [ENH] add image tags to db migration jobs by @codetheweb in #2425
- [ENH] Add sync point to test_filtering + fix issues by @sanketkedia in #2388
- [ENH] clean up component channels by @codetheweb in #2429
- [ENH] Add collection repr by @HammadB in #2433
- [BUG] Ensure arrow sizing is correct / 64 byte aligned by @HammadB in #2426
- [PERF] Use orjson in client by @HammadB in #2434
- [ENH] add (optional) request/reply semantics for component handlers by @codetheweb in #2428
- [ENH] Configurable block size by @HammadB in #2439
- [ENH] Fix issues + add sync point to test embeddings by @sanketkedia in #2397
- [BUG]: Compatibility for
included
for 0.5.0 and earlier by @tazarov in #2445 - [ENH] Read from hnsw cache by @HammadB in #2443
- [BUG] Log purge off by one by @HammadB in #2436
- [BUG] fix consumable join handle panicking after clone by @codetheweb in #2448
- [ENH] S3 Retry by @HammadB in #2457
- [ENH] Compaction error replies by @HammadB in #2456
- [BUG] Allow hnswlinb to single record persist test by @HammadB in #2449
- [BUG] debug docs search by @jeffchuber in #2460
- [ENH] Support metadata update to value of different type by @sanketkedia in #2463
- [BLD] Pin Jaeger version by @sanketkedia in #2465
- [BLD] Service logs as artifacts by @sanketkedia in #2464
- [CLN] Compactor should tolerate all errors or panic by @HammadB in #2467
- [CLN] Don't trace expected error case by @HammadB in #2468
- [BUG]: Thin client imports by @tazarov in #2466
- [CLN] Increasing timeouts && S3 storage error tracing by @HammadB in #2471
- [CLN] Make memberlist reset() not wait longer than it needs to by @HammadB in #2473
- [CLN] ServerAPI to use Collection model by @atroyn in #2300
- [ENH] Collection Configuration Storage by @atroyn in #2338
- [CLN] component recv message span name by @HammadB in #2475
- [BLD] run intensive distributed property tests nightly by @codetheweb in #2469
- [BUG] Make go sysdb return created flag. Respect created flag by @HammadB in #2476
- [TST] ML sleep by @HammadB in #2480
- [CHORE] Add tracing for task names, materialize logs tracing, and pull logs count by @HammadB in #2479
- [BLD] fix docs preview trigger by @codetheweb in #2481
- [ENH] catch panics in handlers by @codetheweb in #2442
- [BUG] Fix 3.12 cross version persist tests by building the wheel manu… by @HammadB in #2482
- [CHORE] Add tracing to blockfile gets by @HammadB in #2485
- [BUG] Fix add -> wait for compaction -> delete -> add case by @sanketkedia in #2474
- [CLN] LogRecord TypedDict attr name incorrect by @HammadB in #2486
- [CLN]: Replacing FastAPI HTTPException with custom ChromaAuthError in basic/token auth by @tazarov in #2477
- [ENH] catch panics in task operators by @codetheweb in #2450
- [RELEASE] Release 0.5.4. Pin hnswlib upgrade. Configuration b/w compat. by @HammadB in #2496
- [BUG][TST] Fix thin client package test by @HammadB in #2497
New Contributors
- @imaffe made their first contribution in #2386
- @nablabits made their first contribution in #2034
- @zacatac made their first contribution in #2387
Full Changelog: 0.5.3...0.5.4
0.5.3
Version: 0.5.3
Git ref: refs/tags/0.5.3
Build Date: 2024-06-18T19:57
PIP Package: chroma-0.5.3.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.3
DockerHub Image: chromadb/chroma:0.5.3
What's Changed
- [BUG] remove hypothesis from prod code path by @codetheweb in #2373
- [RELEASE] 0.5.3 by @codetheweb in #2374
Full Changelog: 0.5.2...0.5.3
0.5.2
Version: 0.5.2
Git ref: refs/tags/0.5.2
Build Date: 2024-06-18T07:07
PIP Package: chroma-0.5.2.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.2
DockerHub Image: chromadb/chroma:0.5.2
What's Changed
- [ENH] Get vectors orchestrator by @HammadB in #2348
- [DOC]: Fix missing OpenLIT reference in Integration sidenav by @patcher9 in #2339
- [DOC] clarify that the async client is always available by @codetheweb in #2365
- [BUG] Mat logs should not populate invalid Adds by @sanketkedia in #2363
- [CLN] Refactor log materializer to not need to pass in offset id explicitly for readers by @sanketkedia in #2354
- [ENH] Handle metadata deletes + fix bugs related to Updates/deletes in the metadata writer by @sanketkedia in #2344
- [RELEASE] 0.5.2 by @HammadB in #2368
Full Changelog: 0.5.1...0.5.2