Runbook: Validation Error - Partner Queue Full
| Service | CM-HUB |
|---|---|
| Owner Team slack handle | @bnl-dev-bali |
| Team's Slack Channel | #bnl-teams-b |
| Alert Channel | #bnl-critical-alerts |
Table of Contents
Important Links
1. Triage
Quickly confirm if the Triplalink API is experiencing a temporary bottleneck.
- Check the specific error message in Slack:
- Does the error include the text:
"Please wait some time. XX request are waiting in the queue"? - If yes, this is a partner-side rate-limiting/congestion issue.
- Does the error include the text:
- Identify the Scope:
- Is this happening for multiple Hotel IDs (e.g.,
HT25005870)? Check the#bnl-critical-alertschannel for a flood of similar messages.
- Is this happening for multiple Hotel IDs (e.g.,
- Check Partner Response:
- Look at the
ShortTextfield in the error. If the number of requests in the queue is increasing over time, the partner system is not clearing its backlog.
- Look at the
2. Decision Point
- IF the error explicitly states "requests are waiting in the queue"
- ➡️ Go to: True Incident
3. False Alarm
No false alarm for this issue
4. True Incident
The partner API is currently saturated and cannot process further ARI updates, which may lead to inventory discrepancies.
4.1. Recover the System
Follow these steps to ensure data synchronization:
Diagnostic Steps:
- Locate the failed ARI notification in the Slack channel.
- Verify the error message confirms the queue limit has been reached.
Remediation Plan:
- Initial Action: Click the Retry button on the Slack message to resend the ARI data from CM-HUB to Triplalink.
- If Error Persists: If after retry still receivng a "Queue Full" error in slack, wait for 10 minutes before attempting to click retry again. This allows the partner's system time to process its existing queue.
Verification:
- Monitor the logs/Slack for that specific Hotel ID.
- Verification is successful if no further errors occur for 15 minutes after the last successful retry click.
4.2. Clean up
Once the system appears stable, perform the following housekeeping tasks:
- Acknowledge Notification: Add a checkmark emoji (✅) to the Slack notification/alert message to indicate that the error has been addressed.
- Final Monitoring: Continuously monitor the Slack notification channel for 30 minutes to ensure no new queue errors appear for the affected Hotel ID or Partner.