10000 Update troubleshooting.md by pj-simpson · Pull Request #1631 · codatio/codat-docs · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Update troubleshooting.md #1631

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Jul 14, 2025
Merged

Conversation

pj-simpson
Copy link
Collaborator

Description

There have been recent issues caused by poor understanding of how our webhooks work on both the part of Codat internally & our consumers. The intention of this PR is to root out some of the areas of ambiguity.

  • Exactly what each state that a message can be in means
  • How the recovery functionality interacts with the retry policy
  • How the disabling functionality interacts with the retry policy
  • Best practises around Idempotency

Type of change

Please delete options that are not relevant.

  • updating existing

Copy link
vercel bot commented Jul 14, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
codat-docs ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jul 14, 2025 1:09pm

@pj-simpson pj-simpson enabled auto-merge July 14, 2025 13:09
Copy link

Overall readability score: 58.05 (🟢 +0)

File Readability
troubleshooting.md 67.56 (🟢 +0)
View detailed metrics

🟢 - Shows an increase in readability
🔴 - Shows a decrease in readability

File Readability FRE GF ARI CLI DCRS
troubleshooting.md 67.56 64.71 10 8.7 8.81 8.59
  🟢 +0 🟢 +0 🟢 +0 🟢 +0 🟢 +0 🟢 +0

Averages:

  Readability FRE GF ARI CLI DCRS
Average 58.05 48.72 10.34 11.54 12.09 7.83
  🟢 +0 🟢 +0 🟢 +0 🟢 +0 🟢 +0 🟢 +0
View metric targets
Metric Range Ideal score
Flesch Reading Ease 100 (very easy read) to 0 (extremely difficult read) 60
Gunning Fog 6 (very easy read) to 17 (extremely difficult read) 8 or less
Auto. Read. Index 6 (very easy read) to 14 (extremely difficult read) 8 or less
Coleman Liau Index 6 (very easy read) to 17 (extremely difficult read) 8 or less
Dale-Chall Readability 4.9 (very easy read) to 9.9 (extremely difficult read) 6.9 or less

Copy link
Collaborator
@dcoplowe dcoplowe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some minor tweaks. Is it possible to either follow up with a section in the docs on rate limiting or as part of this PR?

@@ -56,6 +67,12 @@ Then, click the triple-dot menu on the right and choose one of the applicable op

For more granular date control, you can scroll to the endpoint's message attempts, click the triple-dot options menu of a specific message, and choose **Replay > Replay all failed messages since this time**.

During the recovery of mutiple messages, all messages will be sent at once with some jitter applied in order to prevent overloading the webhook consumer endpoint. If your system has rate-limiting in place, the number of messages to recover may be an important consideration to avoid further failures. Please reach out to Codat Support if unsure.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pj-simpson Svix supports rate limiting endpoints. Could you ask them and confirm that retries will also adhere to end point rate limits? If so, they can control this in the portal. Can we add a rate limit section after Webhoook security on the following page: /using-the-api/webhooks/create-consumer?

image

Comment on lines +32 to +35
- "Success" indicates that there was at least one attempt for that message that succeeded against it's endpoint.
- "Failure" indicates that all attempts were exhausted, and none of them succeeded.
- "Attempting" indicates that at least one attempt has been sent and there are further attempts scheduled as part of the retry policy.
- "Sending" indicates that the process of sending the webhook has begun but there have been no delivery attempts yet.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we change these to bold font? Success

@pj-simpson pj-simpson merged commit d151e30 into main Jul 14, 2025
4 of 7 checks passed
@pj-simpson pj-simpson deleted the pj-simpson-webhook-troubleshooting branch July 14, 2025 15:17

An event that fails three times and then succeeds will be delivered roughly 35 minutes and 5 seconds after the initial attempt.

A webhook consumer endpoint which is disabled after the 5 min retry, but then re-enabled approx 3 hours after that, should have 3 more message attempts scheduled, (5 hours, 10 hours, Additional 10 hours).
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is that 5 hours from the first message attempt or from when the consumer was reenabled?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0