# Playground

### Arkane Cloud playground

Arkane Cloud provides a *model playground* where users can test various AI models [available in Arkane Cloud Playground](https://console.arkanecloud.com/playground) through a web interface without coding.

### How to use the playground

1. On the [Models](https://console.arkanecloud.com/models) page, choose a model and click **Deploy** on its card.
2. In **System prompt**, enter instructions that the model should follow when generating output. You can include examples of good in-context learning outputs (few-shot prompting). For example:

   > Add a fun fact about trains to the end of each response. The fun fact may not be related to what I'm asking you about. Here is an example:
   >
   > \- User: What is the capital of Croatia?\
   > \- You: The capital of Croatia is Zagreb. By the way, did you know that the first railroad in New England was powered by horses?
3. Start chatting to the model.

### Model parameters

The playground allows you to set basic sampling parameters for the model you have chosen.

For more parameters supported by the [inference API](/arkane-cloud/api-reference/api-reference.md), refer to the [vLLM documentation](https://docs.vllm.ai/en/stable/dev/sampling_params.html#vllm.SamplingParams).

#### Temperature

* **Affects**: Output randomness
* **Models**: All
* **Type**: Number
* **Range of values**: From 0 to 1 for meta-llama/Meta-Llama-3, from 0 to 2 for models in other families
* **Default value**: From 0.3 to 0.7, depending on the model

Temperature determines how "hot-headed" the model predictions are. The higher the temperature, the more random and less deterministic and conservative the output.

> For example, a 0.8 temperature makes outputs more creative and random than a 0.5.

#### Max tokens

* **Affects**: Computational cost
* **Models**: All
* **Type**: Number
* **Range of values**: Depends on the model (Usually between 1 to 4096 tokens)
* **Default value**: Depends on the model

The maximum number of tokens that the model generates.&#x20;

#### Presence penalty

* **Affects**: Output randomness
* **Models**: All
* **Type**: Number
* **Range of values**: From −2 to 2
* **Default value**: 0

The presence penalty is applied to new tokens that have previously appeared in the output. Positive values penalize such tokens and negative values favor them.

#### Top-p threshold

* **Affects**: Output randomness
* **Models**: All
* **Type**: Number
* **Range of values**: From 0 to 1
* **Default value**: From 0.9 to 1, depending on the model

In *top-p sampling*, also known as *nucleus sampling*, the model considers only the most probable tokens whose combined probability mass is equal to the specified threshold.

> For example, with a threshold of 0.1, only the tokens that comprise the top 10% of the probability mass are considered.

To consider all tokens, set the threshold to 1.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.arkanecloud.com/arkane-cloud/inference/playground.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
