I am implementing various rate checks and slow downs to make sure not to use all allocated VCU or USD.
The logic tries to calculate application request rate based on remaining VCU or USD, but I am missing information how much time we have left until next epoch. i.e. "nextEpochBegins": "2025-06-01T00:00:00.000Z"
Currently we have set it to static time, but it would be really nice to have it in the API Key Rate Limits response to be able to calculate the application request rate limits dynamically.
Example:{
"data": {
"apiTier": {
"id": "paid",
"isCharged": true
},
"accessPermitted": true,
"keyExpiration": "2025-06-01T00:00:00.000Z",
"nextEpochBegins": "2025-06-01T00:00:00.000Z",
"balances": {
"VCU": 100.023,
"USD": 50.23
},
"rateLimits": [
{
"apiModelId": "llama-3.3-70b",
"rateLimits": [
{
"amount": 100,
"type": "RPM"
}
]
}
]
}
Please authenticate to join the conversation.
Completed
API
11 months ago

TomB
Get notified by email when there are changes.
Completed
API
11 months ago

TomB
Get notified by email when there are changes.