LiquidAI LFM2.5 MoE text generation through the OpenAI-compatible HTTP API
LiquidAI LFM2.5 MoE text generation through the OpenAI-compatible HTTP API.
Start the server: mistralrs serve -p 1234 -m LiquidAI/LFM2.5-8B-A1B
"""LiquidAI LFM2.5 MoE text generation through the OpenAI-compatible HTTP API.
Start the server: mistralrs serve -p 1234 -m LiquidAI/LFM2.5-8B-A1B"""
from openai import OpenAI
client = OpenAI(api_key="foobar", base_url="http://localhost:1234/v1/")
completion = client.chat.completions.create( model="default", messages=[ { "role": "user", "content": "Explain why sparse MoE models can be efficient in two short paragraphs.", } ], max_tokens=256, frequency_penalty=1.0, top_p=0.1, temperature=0.1,)print(completion.choices[0].message.content)Source: examples/server/lfm2_moe.py