on a VM (node.js)
First, purchase a VM from cloud hosting provider, then run the commands as instructed below depending on the OS you installed. Make sure that these machines have at least 8GB of VRAM available.
FullMetal Agent Example
This project provides a Fullmetal Agent written in Node.js that interacts with Fullmetal API.
The main workflow this application is as following:
Register itself to Fullmetal API
Receive prompts from Fullmetal API
Generate a response to the received prompts using LLM running locally
Send the response to Fullmetal API
Installation
Clone the following repository
Install the project and set your configuration parameters
Obtain your
FULLMETAL_API_KEY
by following instructions mentioned hereCreate
.env
file in root folder and add the following keys. Remember all these keys are important. If left blank then your agent may not work properly.
CUDA SUPPORT
Please make sure you have build-essentials
installed on your machine.
To install build-essentials
To build node-llama-cpp with CUDA support please run this command inside of your project:
Download a GGUF LLM file and save it in the models folder. For example, you can use https://huggingface.co/TheBloke/Wizard-Vicuna-7B-Uncensored-GGUF/resolve/main/Wizard-Vicuna-7B-Uncensored.Q2_K.gguf
Usage
To run, type:
Last updated