- 12 Jan, 2026 11 commits
-
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
- Add create_embeddings_async() to support OpenAI batch embedding API - Refactor data_retrieval_tool to batch embed all queries in ONE request - Replace print() with logger.info() in product_search_helpers - Remove visual_search checks (only text search supported) Performance: 5-10x faster for multi-search queries (300ms vs 1.5s for 5 queries) Rate Limit: Saves RPM by batching multiple embeddings into single API call
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
- Dockerfile.dev: Development environment with hot reload & cache mount - Add Makefile targets: build-dev, run-dev for easy local testing - Hot reload watches agent, api, common directories - Includes health check endpoint Usage: make build-dev # Build dev image make run-dev # Run container with volume mount for hot reload
-
Vũ Hoàng Anh authored
- Both Dockerfile.stage and Dockerfile.prod now use --mount=type=cache - Faster builds with BuildKit cache mount (temporary, per-build) - Requires: docker buildx or DOCKER_BUILDKIT=1
-
Vũ Hoàng Anh authored
- Remove --no-cache-dir flag from pip install - Use Docker layer caching for faster production builds - Manual server rebuilds will auto-update packages if needed
-
Vũ Hoàng Anh authored
- Package not compatible with Python 3.11 - Remove to fix Docker build
-
Vũ Hoàng Anh authored
- StarRocksConnection: Use StarRocksConnectionManager singleton with lazy loading - Langfuse: Implement LangfuseClientManager singleton with lazy loading - EmbeddingService: Already using singleton pattern - Remove context managers (Langfuse auto-traces LangChain) - Fix imports across agent/tools and server - Clean up unnecessary comments and fix code organization
-
Vũ Hoàng Anh authored
-
- 08 Jan, 2026 1 commit
-
-
Vũ Hoàng Anh authored
- Removed OpenTelemetry (TracerProvider, OTLPSpanExporter, LoggingInstrumentor, FastAPIInstrumentor) - Implemented Langfuse v3.11.0 with CallbackHandler for LLM tracing - Added langfuse_trace_context() with propagate_attributes() for proper user_id filtering - Fixed user_id to appear in Langfuse User ID filter (not just metadata) - Added session_id and tags propagation for trace organization - Updated controller.py to wrap graph execution in langfuse_trace_context - Verified traces send to self-hosted Langfuse at http://172.16.2.207:3009 - Configuration: LANGFUSE_PUBLIC_KEY, LANGFUSE_SECRET_KEY, LANGFUSE_BASE_URL in .env GIAI_DOAN_1 COMPLETE: LLM observability via Langfuse SDK Next: GIAI_DOAN_2 - Container monitoring stack (cAdvisor + Prometheus + Grafana)
-
- 06 Jan, 2026 2 commits
-
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
- 29 Dec, 2025 10 commits
-
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
- 26 Dec, 2025 7 commits
-
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
- 25 Dec, 2025 2 commits
-
-
Vũ Hoàng Anh authored
-
Vũ Hoàng Anh authored
-
- 23 Dec, 2025 1 commit
-
-
Vũ Hoàng Anh authored
-