NVIDIA api 支持 models 端点查询可用模型,以下是列表:
01-ai/yi-large
aisingapore/sea-lion-7b-instruct
baai/bge-m3
baichuan-inc/baichuan2-13b-chat
databricks/dbrx-instruct
google/codegemma-1.1-7b
google/codegemma-7b
google/gemma-2b
google/gemma-7b
google/recurrentgemma-2b
ibm/granite-34b-code-instruct
ibm/granite-8b-code-instruct
mediatek/breeze-7b-instruct
meta/codellama-70b
meta/llama2-70b
meta/llama3-70b-instruct
meta/llama3-8b-instruct
microsoft/phi-3-medium-4k-instruct
microsoft/phi-3-mini-128k-instruct
microsoft/phi-3-mini-4k-instruct
microsoft/phi-3-small-128k-instruct
microsoft/phi-3-small-8k-instruct
mistralai/codestral-22b-instruct-v0.1
mistralai/mistral-7b-instruct-v0.2
mistralai/mistral-7b-instruct-v0.3
mistralai/mistral-large
mistralai/mixtral-8x22b-instruct-v0.1
mistralai/mixtral-8x22b-v0.1
mistralai/mixtral-8x7b-instruct-v0.1
nvidia/embed-qa-4
nvidia/nemotron-4-340b-instruct
nvidia/nv-embed-v1
seallms/seallm-7b-v2.5
snowflake/arctic
snowflake/arctic-embed-l
thudm/chatglm3-6b
upstage/solar-10.7b-instruct
writer/palmyra-med-70b
writer/palmyra-med-70b-32k
查询可以参考:查看可用模型的小技巧+小工具