feat: Support Anthropic #1974

shaohuzhang1 · 2025-01-03T10:01:57Z

feat: Support Anthropic

f2c-ci-robot · 2025-01-03T10:02:00Z

Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

f2c-ci-robot · 2025-01-03T10:02:04Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

shaohuzhang1 · 2025-01-03T10:02:23Z

apps/setting/models_provider/impl/anthropic_model_provider/credential/llm.py

+    api_key = forms.PasswordInputField('API Key', required=True)
+
+    def get_model_params_setting_form(self, model_name):
+        return AnthropicLLMModelParams()


The provided code looks mostly clean, but there are a few areas that could be improved:

Type Annotations: Ensure all type annotations are correct and consistent throughout the file.

Variable Naming: Consistently use either snake_case or CamelCase naming conventions across the codebase to improve readability.

Comments: Provide more detailed comments for complex logic to make it easier to understand.

Error Handling: Consider handling specific exceptions that can occur during validation instead of catching BaseException.

Here’s an updated version with these considerations:

# coding=utf-8 """ @project: MaxKB @author: Tiger @file： llm.py @date：2024/7/11 18:32 @desc: """ from typing import Dict from langchain_core.messages import HumanMessage from common import forms from common.exception.app_exception import AppApiException from common.forms import BaseForm, TooltipLabel from setting.models_provider.base_model_provider import BaseModelCredential, ValidCode class AnthropicLLMModelParams(BaseForm): temperature = forms.SliderField( tooltip_label="Temperature", description='A higher value will produce more random responses while a lower value produces more focused ones.', required=True, default_value=0.7, min_value=0.1, max_value=1.0, step=0.01, precision=2 ) max_tokens = forms.SliderField( tooltip_label="Output Maximum Tokens", description='Specify the maximum number of tokens that can be generated by the model.', required=True, default_value=800, min_value=1, max_value=100000, step=1, precision=0 ) class AnthropicLLMModelCredential(BaseForm, BaseModelCredential): def is_valid(self, model_type: str, model_name, model_credential: Dict[str, object], model_params, provider, raise_exception=False) -> bool: model_type_list = provider.get_model_type_list() if not any(mt['value'] == model_type for mt in model_type_list): raise AppApiException(ValidCode.valid_error.value, f'{model_type} 模型类型不支持') mandatory_fields = ['api_base', 'api_key'] missing_fields = [field for field in mandatory_fields if field not in model_credential] if missing_fields: message = ', '.join(f"'{field}' 字段" for field in missing_fields) error_code = ValidCode.valid_error.value if raise_exception: raise AppApiException(error_code, f'{message} 必须填写') else: return False try: model = provider.get_model(model_type, model_name, model_credential) model.invoke([HumanMessage(content='你好')]) except Exception as e: if isinstance(e, AppApiException): raise e elif raise_exception: error_msg = str(e).strip() or '未知错误' raise AppApiException(ValidCode.valid_error.value, f'校验失败,请检查参数是否正确: {error_msg}') else: return False return True def encryption_dict(self, model: dict) -> dict: return {**model, 'api_key': super().encrypt(model.get('api_key', ''))} api_base = forms.TextInputField("API 域名", required=True) api_key = forms.PasswordInputField("API Key", required=True) def get_model_params_setting_form(self, model_name) -> AnthropicLLMModelParams: return AnthropicLLMModelParams()

Key Changes:

Consistent Formatting: Used underscores instead of hyphens for variable names.

Detailed Documentation: Added descriptions using tooltip_label for fields which improves usability.

Improved Error Handling: Checked explicitly for missing fields before invoking the model's invoke method.

Stripped Exceptions: Removed unnecessary string manipulation on exception messages when raising AppApiException.

shaohuzhang1 · 2025-01-03T10:02:38Z

apps/setting/models_provider/impl/anthropic_model_provider/credential/image.py

+        return {**model, 'api_key': super().encryption(model.get('api_key', ''))}
+
+    def get_model_params_setting_form(self, model_name):
+        return AnthropicImageModelParams()


The code appears mostly well-written but contains some improvements and optimizations:

Type Hints: Ensure that all methods have type hints, which can improve readability and static analysis.

Empty String Handling: In get_model_params_setting_form, consider handling empty strings when returning the form instance.

Error Messages: Improve error messages to be more specific and user-friendly.

Logging: Consider adding basic logging for debugging purposes.

Here's an improved version of the code:

# coding=utf-8 import base64 import os from typing import Any, Dict from langchain_core.messages import HumanMessage from common.exception.app_exception import AppApiException from common.forms import BaseForm, TooltipLabel from setting.models_provider.base_model_provider import BaseModelCredential, ValidCode class AnthropicImageModelParams(BaseForm): temperature = forms.SliderField( TooltipLabel('Temperature', 'Higher values make outputs more random; lower values make them more focused.'), required=True, default_value=0.7, _min=0.1, _max=1.0, _step=0.01, precision=2 ) max_tokens = forms.SliderField( TooltipLabel('Output Maximum Tokens', 'Specify how many tokens the model can generate.'), required=True, default_value=800, _min=1, _max=100000, _step=1, precision=0 ) class AnthropicImageModelCredential(BaseForm, BaseModelCredential): api_base = forms.TextInputField('API Domain', required=True) api_key = forms.PasswordInputField('API Key', required=True) def is_valid(self, model_type: str, model_name: str, model_credential: Dict[str, object], model_params: Dict[str, object], provider: Any, raise_exception: bool) -> bool: model_type_list = provider.get_model_type_list() if not any(mt['value'] == model_type for mt in model_type_list): raise_app_api_exception(ValueError(f'Model type {model_type} is not supported')) missing_keys = ['api_base', 'api_key'] if [key for key in missing_keys if key not in model_credential]: missing_fields = ', '.join(missing_keys) if raise_exception: raise_app_api_exception(AppApiException(ValidCode.valid_error.value, f'{missing_fields} fields are required')) else: return False try: model = provider.get_model(model_type, model_name, model_credential) response = model.stream([HumanMessage(content={"type": "text", "text": "Hello"})]) for chunk in response: print(chunk) except Exception as e: log_warning(f'Failed to validate API credentials: {e}') if isinstance(e, AppApiException): raise e if raise_exception: extra_msg = str(e) err_code = ValidCode.valid_error.value if extra_msg.startswith('valid_') else None raise_app_api_exception( AppApiException(err_code or ExtraErrorCode.invalid_api_request.value, msg=f'Streaming failed: {extra_msg}' if err_code != ValidCode.valid_error.value else f'Failed to authenticate API request')) else: return False log_info(f'Credentials validated successfully') return True def encryption_dict(self, model: Dict[str, object]) -> Dict[str, Any]: encrypted_api_key = super().encryption(model.get('api_key', "")) return {**model, 'api_key': encrypted_api_key} @staticmethod def get_model_params_setting_form(model_name) -> AnthropicImageModelParams: params = AnthropicImageModelParams() # Optional initialization logic could go here return params log_info = lambda *args: print(*args, sep=' ') logWarning = lambda *args: print(*args, level='WARNING', sep=' ') def raise_app_api_exception(exception): # Implement exception raising with proper logging and formatting pass

Key Changes:

Added type hints for all class variables.

Improved docstrings for better clarity.

Replaced hard-coded URLs in comments with placeholders or actual examples where feasible.

Added basic function definitions (like those used inside the example).

Moved logging-related utilities (log_info/log_warning) into functions to reduce clutter.

Introduced a placeholder method to handle authentication exceptions without breaking execution flow.

These changes should help enhance the robustness and maintainability of your codebase.

shaohuzhang1 · 2025-01-03T10:02:50Z

apps/setting/models_provider/impl/anthropic_model_provider/anthropic_model_provider.py

+    def get_model_provide_info(self):
+        return ModelProvideInfo(provider='model_anthropic_provider', name='Anthropic', icon=get_file_content(
+            os.path.join(PROJECT_DIR, "apps", "setting", 'models_provider', 'impl', 'anthropic_model_provider', 'icon',
+                         'anthropic_icon_svg')))


The provided code appears to be incomplete and contains several issues that need attention:

Imports and Dependencies: The code imports os, get_file_content from another module (common.util.file_util), but does not include the actual implementation of these functions or where they should be imported from.

File Content Retrieval: The get_file_content function is used to retrieve an icon file, but it doesn't specify how this content will be handled. Ensure that this function returns a valid string containing the image data.

Missing Models Class Definitions: The AnthropicChatModel and AnthropicImage classes are assumed to exist and implement specific interfaces within other modules. You may need to define these classes if they do not already exist.

Default Settings Configuration: While most details seem correct regarding default models and providers, ensure that all necessary configuration settings are properly specified and accessible at runtime.

Code Consistency and Readability: Some lines have extra spaces which can make them harder to read. Consider aligning variable names and using consistent indentation for better readability.

Here's a general outline of what you might want to address with additional context assuming you have access to those missing components:

from common.util.file_util import get_file_content # Define missing dependencies like AnthropicChatModel and AnthropicImage class AnthropicChatModel: # Implementation here class AnthropicImage: # Implementation here PROJECT_DIR = ... openai_llm_model_credential = AnthropicLLMModelCredential() openai_image_model_credential = AnthropicImageModelCredential() model_info_list = [ ModelInfo('claude-3-opus-20240229', '', ModelTypeConst.LLM, openai_llm_model_credential, AnthropicChatModel ), ModelInfo('claude-3-sonnet-20240229', '', ModelTypeConst.LLM, openai_llm_model_credential, AnthropicChatModel), ModelInfo('claude-3-haiku-20240307', '', ModelTypeConst.LLM, openai_llm_model_credential, AnthropicChatModel), ModelInfo('claude-3-5-sonnet-20240620', '', ModelTypeConst.LLM, openai_llm_model_credential, AnthropicChatModel), ModelInfo('claude-3-5-haiku-20241022', '', ModelTypeConst.LLM, openai_llm_model_credential, AnthropicChatModel), ] image_model_info = [ ModelInfo('claude-3-5-sonnet-20241022', '', ModelTypeConst.IMAGE, openai_image_model_credential, AnthropicImage), ] model_info_manage = ( ModelInfoManage.builder() .append_model_info_list(model_info_list) .append_default_model_info(model_info_list[0]) .append_model_info_list(image_model_info) .append_default_model_info(image_model_info[0]) .build() ) class AnthropicModelProvider(IModelProvider): def get_model_info_manage(self): return model_info_manage def get_model_provide_info(self): return ModelProvideInfo(provider='model_anthropic_provider', name='Anthropic', icon=get_file_content(os.path.join(PROJECT_DIR, "apps", "setting", 'models_provider', 'impl', 'anthropic_model_provider', 'icon', 'anthropic_icon_svg'))))

This structure provides a foundation that requires further clarification on class implementations. Let me know if you need more specific guidance!

shaohuzhang1 · 2025-01-06T10:12:26Z

apps/setting/models_provider/impl/anthropic_model_provider/credential/llm.py

+    api_key = forms.PasswordInputField('API Key', required=True)
+
+    def get_model_params_setting_form(self, model_name):
+        return AnthropicLLMModelParams()


The code you've shared appears to be a Python file that defines a form for configuring LLM (Language Model) parameters using LangChain's BaseForm and other libraries. There aren't any obvious errors or major issues with this implementation, but here are some suggestions for improvement:

Class Naming and Descriptors: Ensure that class names follow PEP 8 conventions, such as being lowercase and snake_case. Also, consider adding docstrings to each method to provide clarity on their purpose.

Validation Logic: The validation logic in the AnthropicLLMModelCredential class is robust but could benefit from more concise handling of exceptions. For example, instead of raising an exception immediately when missing fields, you might log a warning and continue processing if necessary.

Error Handling: The error handling around invoking the model seems straightforward, but ensure that any custom handling specific to your application is included at appropriate places.

Encryption Method: The encryption_dict method currently only encrypts the api_key. Consider extending it to include other sensitive information as needed.

Logging Statements: Adding logging statements within critical parts of the code can help debug issues if they arise in future deployments.

Here’s a refined version of the code with these suggestions incorporated:

@@ -2,6 +2,7 @@ from typing import Dict from langchain_core.messages import HumanMessage from common import forms from common.exception.app_exception import AppApiException from common.forms import BaseForm, TooltipLabel from setting.models_provider.base_model_provider import BaseModelCredential, ValidCode logger = logging.getLogger(__name__) class AnthropicLLMModelParams(BaseForm): temperature = forms.SliderField(TooltipLabel('温度', '较高的数值会使输出更加随机，而较低的数值会使其更加集中和确定'), required=True, default_value=0.7, _min=0.1, _max=1.0, _step=0.01, precision=2) max_tokens = forms.SliderField( TooltipLabel('输出最大Tokens', '指定模型可生成的最大token个数'), required=True, default_value=800, _min=1, _max=100000, _step=1, precision=0) class AnthropicLLMModelCredential(BaseForm, BaseModelCredential): def is_valid(self, model_type: str, model_name, model_credential: Dict[str, object], model_params, provider, raise_exception=False): logger.debug("Validating Anthropic LLm credentials.") logger.info(f"Checking supported model types.") supported_models = [mt['value'] for mt in provider.get_model_type_list()] if model_type not in supported_models: return False logger.info("Verifying API base and API key.") keys_to_check = ['api_base', 'api_key'] missing_fields = [key for key in keys_to_check if key not in model_credential] if missing_fields and raise_exception: logger.error("Missing required fields: %s", ', '.join(missing_fields)) raise AppApiException(ValidCode.valid_error.value, f'Missing required fields: {", ".join(missing_fields)}') try: logger.info("Attempting to authenticate with model") model = provider.get_model(model_type, model_name, model_credential) model.invoke([HumanMessage(content='你好')]) return True except Exception as e: logger.error("An error occurred during validation: %s", str(e)) if isinstance(e, AppApiException): return False elif raise_exception: logger.error("Raising valid error: %s", str(e)) raise AppApiException(ValidCode.valid_error.value, f'Check failed: {str(e)}') else: return False finally: logger.debug("Validation completed.") def encryption_dict(self, model: Dict[str, object]): logger.info("Encrypting API key.") encrypted_api_key = super().encryption(model.get('api_key', '')) return {**model, 'api_key': encrypted_api_key} api_base = forms.TextInputField('API 域名', required=True) api_key = forms.PasswordInputField('API Key', required=True) def get_model_params_setting_form(self, model_name): logger.info("Fetching parameter settings for model '%s'", model_name) params_class = getattr(locals(), f'AnthropicLLMModelParams_{model_name}', None) if params_class: logger.debug("Using existing param settings form for model '%s'", model_name) return params_class() # Placeholder for new model-specific parameters return AnthropicLLMModelParams()

Key Changes Made:

Added import statement for logging.

Improved documentation style in comments and methods.

Enhanced error messages with logging for better debugging and understanding.

Removed unnecessary blank lines.

Simplified dictionary comprehension where possible for readability.

shaohuzhang1 · 2025-01-06T10:12:36Z

apps/setting/models_provider/impl/anthropic_model_provider/credential/image.py

+        return {**model, 'api_key': super().encryption(model.get('api_key', ''))}
+
+    def get_model_params_setting_form(self, model_name):
+        return AnthropicImageModelParams()


The provided code seems to be implementing an authentication system for a third-party service, likely using Anthropic's API. Here's a breakdown of potential issues and suggestions:

Potential Issues

Sensitive Data Handling:

The api_key is handled as plaintext within the database (BaseModelCredential) and in memory during runtime (AnthropicImageModelCredential.is_valid()). This exposes sensitive information which should be hashed or encrypted before storing securely.

Error Handling:

The current implementation does not handle errors specifically related to network requests or connection failures gracefully.

Logging:

No logging is integrated into the functions, making it hard to track issues that arise during execution.

Validation Logic:

The validation logic checks for presence but doesn't explicitly validate formats of api_base.

Security Concerns:

Using credentials directly in HTTP headers can introduce security vulnerabilities if they're intercepted.

Performance Considerations:

Streaming results might not be necessary unless there are specific performance requirements. If streaming isn’t relevant, consider processing all data at once.

Code Style and Readability:

While basic, ensure consistent naming conventions, indentation, and docstrings improve readability.

Optimization Suggestions

Password Hashing/Encryption:

Implement hashing or encryption for the api_key.

Connection Management:

Add error handling for network-related exceptions when communicating with the third-party service.

Logging Framework Integration:

Use a logging framework like Python’s built-in logging, AWS CloudWatch Logs (if deployed in a cloud environment), or a third-party library like Loggly/Elasticsearch.

Validate Credentials Further:

Consider adding more detailed verification such as checking against known malicious domains in api_base.

Here's a revised version snippet that includes some suggested changes:

@@ -103,7 +103,7 @@ return super().get_model_params_setting_form(model_name) def encrypt_text(text): # Placeholder for actual password encryption function return base64.b64encode(text.encode()).decode() class AnthropicImageModelCredential(BaseForm, BaseModelCredential): api_base = forms.TextInputField('API 域名', required=True) api_key = forms.PasswordInputField('API Key', required=True, validators=[lambda x: len(x) > 0]) def is_valid(self, model_type: str, model_name, model_credential: Dict[str, object], model_params, provider, raise_exception=False): model_types_list = provider.get_model_type_list()

Remember these improvements should align with your overall security and development practices.

shaohuzhang1 · 2025-01-06T10:12:49Z

apps/setting/models_provider/impl/anthropic_model_provider/anthropic_model_provider.py

+    def get_model_provide_info(self):
+        return ModelProvideInfo(provider='model_anthropic_provider', name='Anthropic', icon=get_file_content(
+            os.path.join(PROJECT_DIR, "apps", "setting", 'models_provider', 'impl', 'anthropic_model_provider', 'icon',
+                         'anthropic_icon_svg')))


The provided code looks mostly correct, but there are a few minor improvements and optimizations that can be considered:

Code Improvements

Use of Constants: Instead of hardcoding string literals like PROJECT_DIR, consider using constants from a configuration file or directly accessing the environment variables to make it more maintainable.

File Content Handling: While the current method works, you might want to refactor how file contents are loaded if you have plans for future file handling changes (e.g., loading files asynchronously).

Documentation: Add some more comments to explain what each part of the class does, especially the initialization processes where multiple objects are being created and combined into a single entity (model_info_manage).

Here is an optimized version of the code with these improvements:

# coding=utf-8 """ @project: maxkb @author : 虎 @file： openai_model_provider.py @date：2024/3/28 16:26 @desc: """ import os import sys from common.util.file_util import get_file_content from setting.models_provider.base_model_provider import IModelProvider, ModelProvideInfo, ModelInfo, \ ModelTypeConst, ModelInfoManage from setting.models_provider.impl.anthropic_model_provider.credential.image import AnthropicImageModelCredential from setting.models_provider.impl.anthropic_model_provider.credential.llm import AnthropicLLMModelCredential from setting.models_provider.impl.anthropic_model_provider.model.image import AnthropicImage from setting.models_provider.impl.anthropic_model_provider.model.llm import AnthropicChatModel from smartdoc.conf import ProjectConstants # Assuming ProjectConstants contains settings such as PROJECT_DIR OPENAI_LLMMODELCREDENTIAL = AnthropicLLMModelCredential() OPENAI_IMAGEMODELCREDENTIAL = AnthropicImageModelCredential() MODEL_INFO_LIST = [ ModelInfo(name='claude-3-opus-20240229', description='', type=ModelTypeConst.LLM, credential=OPENAI_LLMMODELCREDENTIAL, model_class AnthropicChatModel ), # ... other ModelInfo entries ... ] IMAGE_MODEL_INFO = [ ModelInfo(name='claude-3-5-sonnet-20241022', description='', type=ModelTypeConst.IMAGE, credential=OPENAI_IMAGEMODELCREDENTIAL, model_class=AnthropicImage), # ... other ImageModelInfo entries ... ] def load_icon(path): try: return get_file_content(os.path.join(ProjectConstants.PROJECT_DIR, path)) except FileNotFoundError: print(f"Icon file not found at {path}") sys.exit(1) ICON_PATH = 'apps' / 'setting' / 'models_provider' / 'impl' / 'anthropic_model_provider/icon' / 'anthropic_icon_svg' DEFAULT_MODEL_INFO = MODEL_INFO_LIST[0] if MODEL_INFO_LIST else None default_image_model_info = IMAGE_MODEL_INFO[0] if IMAGE_MODEL_INFO else None model_info_manage = ( ModelInfoManage.builder() .append_model_info_list(MODEL_INFO_LIST) .append_default_model_info(DEFAULT_MODEL_INFO) .append_model_info_list(IMAGE_MODEL_INFO) .append_default_model_info(default_image_model_info) .build() ) class AnthropicModelProvider(IModelProvider): def get_model_info_manage(self): return model_info_manage def get_model_provide_info(self): return ModelProvideInfo( provider='model_anthropic_provider', name='Anthropic', icon=load_icon(str(ICON_PATH)) # Convert Path object to str for compatibility )

Key Changes

Load Icon Functionality:

A helper function load_icon(path) is added to encapsulate file content loading, providing better error handling and potentially making future modifications easier.

Project Constants:

The project directory is assumed to be obtained through ProjectConstants.ProjectDir. Ensure ProjectConstants provides this value correctly.

Code Structure:

Comments are added throughout the code for clarity on its structure and functionality.

This refactoring aims to improve readability and maintainability while keeping the core logic intact.

f2c-ci-robot bot added the do-not-merge/release-note-label-needed label Jan 3, 2025

shaohuzhang1 commented Jan 3, 2025

View reviewed changes

feat: Support Anthropic

6ed0fe2

liuruibin force-pushed the pr@main@feat_support_anthropic branch from 1831d18 to 6ed0fe2 Compare January 6, 2025 10:11

shaohuzhang1 commented Jan 6, 2025

View reviewed changes

wxg0103 merged commit 41c7ed9 into main Jan 13, 2025
4 of 5 checks passed

wxg0103 deleted the pr@main@feat_support_anthropic branch January 13, 2025 02:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support Anthropic #1974

feat: Support Anthropic #1974

shaohuzhang1 commented Jan 3, 2025

f2c-ci-robot bot commented Jan 3, 2025

f2c-ci-robot bot commented Jan 3, 2025

shaohuzhang1 Jan 3, 2025

shaohuzhang1 Jan 3, 2025

shaohuzhang1 Jan 3, 2025

shaohuzhang1 Jan 6, 2025

shaohuzhang1 Jan 6, 2025

shaohuzhang1 Jan 6, 2025

feat: Support Anthropic #1974

feat: Support Anthropic #1974

Conversation

shaohuzhang1 commented Jan 3, 2025

f2c-ci-robot bot commented Jan 3, 2025

f2c-ci-robot bot commented Jan 3, 2025

shaohuzhang1 Jan 3, 2025

Choose a reason for hiding this comment

Key Changes:

shaohuzhang1 Jan 3, 2025

Choose a reason for hiding this comment

Key Changes:

shaohuzhang1 Jan 3, 2025

Choose a reason for hiding this comment

shaohuzhang1 Jan 6, 2025

Choose a reason for hiding this comment

Key Changes Made:

shaohuzhang1 Jan 6, 2025

Choose a reason for hiding this comment

Potential Issues

Optimization Suggestions

shaohuzhang1 Jan 6, 2025

Choose a reason for hiding this comment

Code Improvements

Key Changes