Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
e2e_suite_test.go		e2e_suite_test.go
e2e_test.go		e2e_test.go

README.md

End-to-End Tests

This document provides instructions on how to run the end-to-end tests.

Overview

The end-to-end tests are designed to validate end-to-end Gateway API Inference Extension functionality. These tests are executed against a Kubernetes cluster and use the Ginkgo testing framework to ensure the extension behaves as expected.

Prerequisites

Go installed on your machine.
Make installed to run the end-to-end test target.
A Hugging Face Hub token with access to the meta-llama/Llama-3.1-8B-Instruct model.

Running the End-to-End Tests

Follow these steps to run the end-to-end tests:

Clone the Repository: Clone the gateway-api-inference-extension repository:

git clone https://github.com/kubernetes-sigs/gateway-api-inference-extension.git && cd gateway-api-inference-extension

Export Your Hugging Face Hub Token: The token is required to run the test model server:
```
export HF_TOKEN=<MY_HF_TOKEN>
```
(Optional): Set the test namespace: By default, the e2e test creates resources in the inf-ext-e2e namespace. If you would like to change this namespace, set the following environment variable:
```
export E2E_NS=<MY_NS>
```
Run the Tests: Run the test-e2e target:
```
make test-e2e
```
The test suite prints details for each step. Note that the vllm-llama3-8b-instruct-pool model server deployment may take several minutes to report an Available=True status due to the time required for bootstraping.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epp

epp

README.md

End-to-End Tests

Overview

Prerequisites

Running the End-to-End Tests

Files

epp

Directory actions

More options

Directory actions

More options

Latest commit

History

epp

Folders and files

parent directory

README.md

End-to-End Tests

Overview

Prerequisites

Running the End-to-End Tests