open-webui集成x.ai新出的grok-2-image模型生图，150刀有去处啦

Jimmytinsley · 2025 年3 月 20 日 05:56

在佬友 @fl0w1nd 之前分享的fal.ai生图函数的基础上，花十分钟让cursor修改了一下，活比较糙，但是能用！

参考链接： open-webui 使用 fal.ai 绘图函数[Flux-Pro] - 开发调优 - LINUX DO

import aiohttp
import asyncio
import json
from typing import Optional
from pydantic import BaseModel, Field


async def emit(emitter, msg, done):
    await emitter(
        {
            "type": "status",
            "data": {
                "done": done,
                "description": msg,
            },
        }
    )


class Filter:
    class Valves(BaseModel):
        priority: int = Field(
            default=0,
            description="Priority level for the filter operations.",
        )
        api_key: str = Field(
            default="",
            description="Grok API 密钥",
        )
        api_base: str = Field(
            default="https://api.x.ai",
            description="Grok API 基础URL",
        )

    class UserValves(BaseModel):
        model: str = Field(
            default="grok-2-image",
            description="要使用的图像生成模型",
        )
        n: int = Field(
            default=1,
            description="要生成的图像数量，1-10之间",
        )
        response_format: str = Field(
            default="url",
            description="返回格式，可以是url或b64_json",
        )
        user: Optional[str] = Field(
            default=None,
            description="代表终端用户的唯一标识符，可帮助监控和检测滥用行为",
        )

    def __init__(self):
        self.valves = self.Valves()

    async def inlet(self, body, __user__, __event_emitter__):
        await emit(__event_emitter__, "正在准备图像生成请求，请等待...", False)
        return body

    async def request(self, prompt, __user__, __event_emitter__):
        url = f"{self.valves.api_base}/v1/images/generations"

        headers = {
            "Content-Type": "application/json",
            "Authorization": f"Bearer {self.valves.api_key}",
        }

        payload = {
            "prompt": prompt,
            "model": __user__["valves"].model,
            "n": __user__["valves"].n,
            "response_format": __user__["valves"].response_format,
        }

        if user := __user__["valves"].user:
            payload["user"] = user

        async with aiohttp.ClientSession() as sess:
            await emit(__event_emitter__, "正在发送图像生成请求...", False)

            try:
                response = await sess.post(url, json=payload, headers=headers)
                response_text = await response.text()

                if response.status != 200:
                    await emit(
                        __event_emitter__,
                        f"请求失败 ({response.status}): {response_text}",
                        True,
                    )
                    return []

                response_data = json.loads(response_text)

                if "data" not in response_data or not response_data["data"]:
                    await emit(__event_emitter__, "返回数据中不包含图像信息", True)
                    return []

                images = []
                for i, image_data in enumerate(response_data["data"]):
                    if "url" in image_data and image_data["url"]:
                        url = image_data["url"]
                        revised_prompt = image_data.get("revised_prompt", "")
                        images.append(
                            {
                                "image": f"![image{i}]({url})",
                                "revised_prompt": revised_prompt,
                            }
                        )
                    elif "b64_json" in image_data and image_data["b64_json"]:
                        # 这里可以处理base64编码的图像，但目前我们只是简单返回信息
                        revised_prompt = image_data.get("revised_prompt", "")
                        images.append(
                            {
                                "image": f"(Base64编码图像)",
                                "revised_prompt": revised_prompt,
                            }
                        )

                if images:
                    await emit(
                        __event_emitter__, f"图片生成成功，共{len(images)}张!", True
                    )
                    return images

                await emit(__event_emitter__, "未能获取到图像数据", True)
                return []

            except Exception as e:
                await emit(__event_emitter__, f"请求过程中发生错误: {str(e)}", True)
                return []

    async def outlet(self, body, __user__, __event_emitter__):
        await emit(__event_emitter__, f"正在生成图片，请等待...", False)
        last = body["messages"][-1]
        res = await self.request(last["content"], __user__, __event_emitter__)

        if res:
            # 将每个图像和其修改后的提示词添加到消息中
            for item in res:
                image = item["image"]
                revised_prompt = item.get("revised_prompt", "")

                # 添加图像和修改后的提示词（如果有）
                last["content"] = (
                    f"{image}\n\n修改后的提示词: {revised_prompt}\n\n{last['content']}"
                )

        return body

添加函数和模型的流程推荐参照原帖，大概步骤是：

管理员页面新建函数保存
函数页面点击齿轮，配置x.ai的apikey
工作台中新建模型，选择一个基础模型和prompt，过滤器选择刚刚新建的函数。模型的prompt可以自由发挥，也可以参考原帖中的。

碎碎念：grok-2-image的api生图似乎会自己revise prompt，没有关掉的地方，会自己添油加醋，导致出图与自己想象中有出入，速度和图像质量尚可。函数写得不足的地方，佬友也可以查看api文档自己修改

参考示例

XDX-pp · 2025 年3 月 20 日 06:00

不得不玩了

user45 · 2025 年3 月 20 日 06:00

这就试试

RDNE · 2025 年3 月 20 日 06:04

又有好玩的了

sniao · 2025 年3 月 20 日 06:06

感谢分享

coderZoe · 2025 年3 月 20 日 06:09

马斯克也太慢了，磨磨唧唧半天不出3的api

user_init · 2025 年3 月 20 日 06:18

大佬，按你的操作一步步实现了，但是不出图嘞

反倒是生成了提示词

OYO · 2025 年3 月 20 日 06:19

cherry studio不知道啥时候支持

user45 · 2025 年3 月 20 日 06:19

我的还是404不知道咋回事

Jimmytinsley · 2025 年3 月 20 日 06:20

apikey有正确设置吗？我刚开始忘记设置key了，好像也是只输出提示词

user_init · 2025 年3 月 20 日 06:26

设置了的

默认值我都给它设置了

xixia · 2025 年3 月 20 日 06:27

看看函数启用了没。。。我刚才没启用，和你这个一模一样

user_init · 2025 年3 月 20 日 06:31

还真是诶，哈哈哈哈，可以了，感谢佬

user1247 · 2025 年3 月 20 日 06:34

代理怎么挂上去

handsome · 2025 年3 月 20 日 06:42

我还是慢慢等3

Jimmytinsley · 2025 年3 月 20 日 06:57

软路由

coderZoe · 2025 年3 月 21 日 02:15

我也是只有提示词

函数开启了，甚至开启了全局，是我的模型设置的有问题吗

Jimmytinsley · 2025 年3 月 21 日 02:20

梯子相关也可以排查下哈，我是挂了软路由的

578382239 · 2025 年3 月 21 日 02:20

学习一下，感谢大佬

coderZoe · 2025 年3 月 21 日 02:36

梯子应该没问题，owu日志没看到网络异常，我用的docker环境变量http_proxy，请求oai都可以正常走梯子的

话题		回复	浏览量
在OpenWebUI中使用FLUX绘画（硅基流动）搞七捻三 AFF , 人工智能	48	1826	2025 年2 月 7 日
open-webui通过函数调用grok-2-image模型生成图片开发调优人工智能 , OpenWebUI	28	566	2025 年3 月 24 日
open-webui 使用 fal.ai 绘图函数[Flux-Pro] 开发调优人工智能 , OpenWebUI	13	786	2025 年3 月 23 日
🌋【Grok $150】NextChat 调用低审核的 Grok-2-Image 文生图。资源荟萃 AIGC , 人工智能	53	1355	2025 年3 月 28 日
【api-checker】纯前端API 检测工具 v1.3 快来测测你GPT API是否掺假资源荟萃 OpenAI , 人工智能	56	3234	2025 年3 月 18 日

open-webui集成x.ai新出的grok-2-image模型生图，150刀有去处啦

相关话题