crewAI

mirror of https://github.com/crewAIInc/crewAI.git synced 2026-01-11 09:08:31 +00:00

Author	SHA1	Message	Date
Devin AI	36673f89e7	Fix google-generativeai embedder validation error (issue #3741 ) This commit fixes the validation error that occurred when using the google-generativeai embedder provider with a flat configuration format. Changes: 1. Made the 'config' field optional in GenerativeAiProviderSpec by adding 'total=False' and marking 'provider' as Required, consistent with other provider specs like VertexAIProviderSpec. 2. Added normalization in the Crew class to automatically convert flat embedder configs to nested format before validation. This allows users to use either format: - Flat: {'provider': 'google-generativeai', 'api_key': '...', 'model_name': '...'} - Nested: {'provider': 'google-generativeai', 'config': {'api_key': '...', 'model_name': '...'}} 3. Updated the embedder factory to support both flat and nested config formats by checking for the presence of 'config' key and extracting config fields accordingly. 4. Added comprehensive tests to verify both formats work correctly: - Test for flat config format (the issue reported in #3741) - Test for nested config format (recommended format) - Test for TypedDict validation Fixes #3741 Co-Authored-By: João <joao@crewai.com>	2025-10-20 19:26:08 +00:00
Greyson LaLonde	12fa7e2ff1	fix: rename watson to watsonx embedding provider and prefix env vars - prefix provider env vars with embeddings_ - rename watson → watsonx in providers - add deprecation warning and alias for legacy 'watson' key (to be removed in v1.0.0)	2025-09-26 10:57:18 -04:00
Greyson LaLonde	ce5ea9be6f	feat: add custom embedding types and migrate providers - introduce baseembeddingsprovider and helper for embedding functions - add core embedding types and migrate providers, factory, and storage modules - remove unused type aliases and fix pydantic schema error - update providers with env var support and related fixes	2025-09-25 18:28:39 -04:00
Greyson LaLonde	1dbe8aab52	fix: add batch_size support to prevent embedder token limit errors - add batch_size field to baseragconfig (default=100) - update chromadb/qdrant clients and factories to use batch_size - extract and filter batch_size from embedder config in knowledgestorage - fix large csv files exceeding embedder token limits (#3574) - remove unneeded conditional for type Co-authored-by: Vini Brasil <vini@hey.com>	2025-09-24 00:05:43 -04:00
Greyson LaLonde	4ac65eb0a6	fix: support nested config format for embedder configuration Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Notify Downstream / notify-downstream (push) Has been cancelled Details Update Test Durations / update-durations (3.10) (push) Has been cancelled Details Update Test Durations / update-durations (3.11) (push) Has been cancelled Details Update Test Durations / update-durations (3.12) (push) Has been cancelled Details Update Test Durations / update-durations (3.13) (push) Has been cancelled Details - support nested config format with embedderconfig typeddict - fix parsing for model/model_name compatibility - add validation, typing_extensions, and improved type hints - enhance embedding factory with env var injection and provider support - add tests for openai, azure, and all embedding providers - misc fixes: test file rename, updated mocking patterns	2025-09-23 11:57:46 -04:00
Greyson LaLonde	58413b663a	chore: fix ruff linting issues in rag module linting, list embedding handling, and test update	2025-09-22 13:06:22 -04:00
Greyson LaLonde	d4aa676195	feat: add configurable search parameters for RAG, knowledge, and memory (#3531 ) - Add limit and score_threshold to BaseRagConfig, propagate to clients - Update default search params in RAG storage, knowledge, and memory (limit=5, threshold=0.6) - Fix linting (ruff, mypy, PERF203) and refactor save logic - Update tests for new defaults and ChromaDB behavior	2025-09-18 16:58:03 -04:00
Greyson LaLonde	f28e78c5ba	refactor: unify rag storage with instance-specific client support (#3455 ) Some checks failed Notify Downstream / notify-downstream (push) Has been cancelled Details Update Test Durations / update-durations (3.10) (push) Has been cancelled Details Update Test Durations / update-durations (3.11) (push) Has been cancelled Details Update Test Durations / update-durations (3.12) (push) Has been cancelled Details Update Test Durations / update-durations (3.13) (push) Has been cancelled Details Build uv cache / build-cache (3.10) (push) Has been cancelled Details Build uv cache / build-cache (3.11) (push) Has been cancelled Details Build uv cache / build-cache (3.12) (push) Has been cancelled Details Build uv cache / build-cache (3.13) (push) Has been cancelled Details - ignore line length errors globally - migrate knowledge/memory and crew query_knowledge to `SearchResult` - remove legacy chromadb utils; fix empty metadata handling - restore openai as default embedding provider; support instance-specific clients - update and fix tests for `SearchResult` migration and rag changes	2025-09-17 14:46:54 -04:00
Greyson LaLonde	ec1eff02a8	fix: achieve parity between rag package and current impl (#3418 ) Some checks failed Notify Downstream / notify-downstream (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details - Sanitize ChromaDB collection names and use original dir naming - Add persistent client with file locking to the ChromaDB factory - Add upsert support to the ChromaDB client - Suppress ChromaDB deprecation warnings for `model_fields` - Extract `suppress_logging` into shared `logger_utils` - Update tests to reflect upsert behavior - Docs: add additional note	2025-08-28 11:22:36 -04:00
Greyson LaLonde	4b4a119a9f	refactor: simplify rag client initialization (#3401 ) * Simplified Qdrant and ChromaDB client initialization * Refactored factory structure and updated tests accordingly	2025-08-26 08:54:51 -04:00
Greyson LaLonde	7ac482c7c9	feat: rag configuration with optional dependency support (#3394 ) Some checks failed Notify Downstream / notify-downstream (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details ### RAG Config System * Added ChromaDB client creation via config with sensible defaults * Introduced optional imports and shared RAG config utilities/schema * Enabled embedding function support with ChromaDB provider integration * Refactored configs for immutability and stronger type safety * Removed unused code and expanded test coverage	2025-08-26 00:00:22 -04:00
Greyson LaLonde	2e4bd3f49d	feat: qdrant generic client (#3377 ) Some checks failed Notify Downstream / notify-downstream (push) Has been cancelled Details ### Qdrant Client * Add core client with collection, search, and document APIs (sync + async) * Refactor utilities, types, and vector params (default 384-dim) * Improve error handling with `ClientMethodMismatchError` * Add score normalization, async embeddings, and optional `qdrant-client` dep * Expand tests and type safety throughout	2025-08-25 16:02:25 -04:00
Greyson LaLonde	842bed4e9c	feat: chromadb generic client (#3374 ) Some checks failed Notify Downstream / notify-downstream (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details Add ChromaDB client implementation with async support - Implement core collection operations (create, get_or_create, delete) - Add search functionality with cosine similarity scoring - Include both sync and async method variants - Add type safety with NamedTuples and TypeGuards - Extract utility functions to separate modules - Default to cosine distance metric for text similarity - Add comprehensive test coverage TODO: - l2, ip score calculations are not settled on	2025-08-21 18:18:46 -04:00

13 Commits