Step 1: Configure your Player Character

Add the Glade Core Component:
- Open your player character Blueprint.
- Click + Add in the Components panel and search for GladeCoreComponent. Add it to the character. This will automatically add all the necessary sub-components that are configurable within this component.
Configure the LLM Component:
- In the Components panel, select the GladeCore LLMComponent that was just added.
  - Under the Chat category, assign your UChatInputWidget Blueprint to the ChatInputWidgetClass property.
  - Under the LLM category, you will find the LLMServiceManager instance. This is where you configure the core AI service.
- Select the appropriate template handler based on the model you are using. By default, use Auto Detect Template Handler, which inspects the loaded GGUF filename and automatically applies the correct chat template. See here for a list of models we support and the corresponding templates to use.
GPU Layer Offloading
- In the LLMServiceManager, the GPU Layers integer controls how many model layers are offloaded to the GPU. This lets you tune VRAM usage vs. performance.
- Values:
  - 999 (default): All layers on GPU - maximum performance, requires enough VRAM for the full model
  - 0: CPU only - no GPU used, slowest but works on any machine
  - 1-N: Partial offload - first N layers on GPU, remainder on CPU (useful for limited VRAM)