Added rest_server connector #2

AaLexUser · 2024-08-26T14:07:43Z

Langchain connector for a typical REST server

jrzkaminski

A decent pull request, however, there are some minor things to improve

protollm/connectors/rest_server.py

jrzkaminski · 2024-08-28T12:39:19Z

protollm/connectors/rest_server.py

General Suggestions:
It would be easier for me to review that if there was a simple small usage example. Also tests would not hurt since they are easily generated by LLMs.
Custom errors (ChatRESTServerError or smth) instead of ValueErrors would be better, but that requires writing an Exceptions module, which is quite easy to do.
A custom logger module with logs would not hurt too.

With custom errors I agree, but it is ValueErrors that throws similar langchain classes in these situations. I thought about tests, but they are not possible, for the reason that it is an open library and I can't use any internal LLM url.
Logging tools for Runnables are built into langchain.

protollm/connectors/rest_server.py

Co-authored-by: Jerzy Kamiński <[email protected]>

jrzkaminski · 2024-08-28T14:37:07Z

protollm/connectors/rest_server.py

+                messages)
+        }
+        response = requests.post(
+            url=f'{self.base_url}/v1/chat/completions',


Some older models use /v1/completions with manual tokens. Also, some newer models provide more control over prompt in raw /v1/completions mode, which could be extremely beneficial. I have no clue how to generalize that, but that's something to keep in mind, because come users may have a custom LLM that operates custom tokens. There are also other modes like tokenization and so on.

I think it's for the LLM classes in langchain. ChatBaseModel classes assume chat exactly, not custom tokens. Maybe it makes sense to let the developer choose the endpoint, but then I think the whole point of this class is lost. There are too many additional settings and I would choose to write my own class then. But I don't know, it's debatable.

jrzkaminski · 2024-08-28T14:50:49Z

protollm/connectors/rest_server.py

+            stop: Optional[List[str]] = None,
+            **kwargs: Any,
+    ) -> Dict[str, Any]:
+        payload = {


Max_tokens, temperature and other parameters must be configurable by user, consider adding these fields to that dictionary.

What we have on our server now does not support configuration of any parameters. When it will be available then I will add and test it.

Added rest_server connector

c81b756

AaLexUser added the enhancement New feature or request label Aug 26, 2024

AaLexUser requested a review from nicl-nno August 26, 2024 14:07

AaLexUser self-assigned this Aug 26, 2024

nicl-nno requested review from Nunkyl and jrzkaminski August 28, 2024 10:15

jrzkaminski requested changes Aug 28, 2024

View reviewed changes

Apply suggestions from code review

45d3875

Co-authored-by: Jerzy Kamiński <[email protected]>

jrzkaminski requested changes Aug 28, 2024

View reviewed changes

jrzkaminski reviewed Aug 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added rest_server connector #2

Added rest_server connector #2

AaLexUser commented Aug 26, 2024

jrzkaminski left a comment

jrzkaminski Aug 28, 2024

AaLexUser Aug 28, 2024

jrzkaminski Aug 28, 2024 •

edited

Loading

AaLexUser Aug 28, 2024

jrzkaminski Aug 28, 2024

AaLexUser Aug 28, 2024

Added rest_server connector #2

Are you sure you want to change the base?

Added rest_server connector #2

Conversation

AaLexUser commented Aug 26, 2024

jrzkaminski left a comment

Choose a reason for hiding this comment

jrzkaminski Aug 28, 2024

Choose a reason for hiding this comment

AaLexUser Aug 28, 2024

Choose a reason for hiding this comment

jrzkaminski Aug 28, 2024 • edited Loading

Choose a reason for hiding this comment

AaLexUser Aug 28, 2024

Choose a reason for hiding this comment

jrzkaminski Aug 28, 2024

Choose a reason for hiding this comment

AaLexUser Aug 28, 2024

Choose a reason for hiding this comment

jrzkaminski Aug 28, 2024 •

edited

Loading