Skip to content

docs: add Redis solution articles ported from internal Confluence#753

Open
chideat wants to merge 2 commits intomainfrom
redis-kb
Open

docs: add Redis solution articles ported from internal Confluence#753
chideat wants to merge 2 commits intomainfrom
redis-kb

Conversation

@chideat
Copy link
Copy Markdown
Collaborator

@chideat chideat commented May 7, 2026

Port 23 troubleshooting and how-to articles for Alauda Cache Service for Redis OSS (redis-operator), filling gaps not covered by docs.alauda.io/redis/5.0/. Topics include sentinel password setup, dangerous-command ACL, slow log, custom commands, RedisInsight, RedisProxyOperator, Navicat client, cluster slot/node recovery, backup template compatibility, RedisShake migration, and an emergency playbook. Each doc carries an applicable-version callout and was reviewed against operator v5.0.x; legacy procedures are clearly scoped or routed to current alternatives.

Port 23 troubleshooting and how-to articles for Alauda Cache Service for Redis OSS
(redis-operator), filling gaps not covered by docs.alauda.io/redis/5.0/. Topics
include sentinel password setup, dangerous-command ACL, slow log, custom commands,
RedisInsight, RedisProxyOperator, Navicat client, cluster slot/node recovery,
backup template compatibility, RedisShake migration, and an emergency playbook.
Each doc carries an applicable-version callout and was reviewed against operator
v5.0.x; legacy procedures are clearly scoped or routed to current alternatives.
Copilot AI review requested due to automatic review settings May 7, 2026 03:23
@coderabbitai
Copy link
Copy Markdown
Contributor

coderabbitai Bot commented May 7, 2026

Warning

Rate limit exceeded

@chideat has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 45 minutes and 57 seconds before requesting another review.

To continue reviewing without waiting, purchase usage credits in the billing tab.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: b4622d75-dc5c-40f8-983d-2c999eda8ef4

📥 Commits

Reviewing files that changed from the base of the PR and between 7ae5a38 and a3575ee.

📒 Files selected for processing (23)
  • docs/en/solutions/ecosystem/redis/Backup_Restore_Template_Compatibility.md
  • docs/en/solutions/ecosystem/redis/How_to_Add_Custom_Redis_Commands.md
  • docs/en/solutions/ecosystem/redis/How_to_Cleanup_Invalid_Cluster_Nodes.md
  • docs/en/solutions/ecosystem/redis/How_to_Configure_Redis_MaxMemory.md
  • docs/en/solutions/ecosystem/redis/How_to_Connect_to_Redis_Sentinel_with_Navicat.md
  • docs/en/solutions/ecosystem/redis/How_to_Deploy_RedisInsight_Web_Console.md
  • docs/en/solutions/ecosystem/redis/How_to_Deploy_Redis_Proxy_with_RedisProxyOperator.md
  • docs/en/solutions/ecosystem/redis/How_to_Fix_Sentinel_Multi_Instance_Merge.md
  • docs/en/solutions/ecosystem/redis/How_to_Manage_Dangerous_Redis_Commands.md
  • docs/en/solutions/ecosystem/redis/How_to_Manually_Remove_Failed_Cluster_Nodes.md
  • docs/en/solutions/ecosystem/redis/How_to_Migrate_Redis_Across_Clusters.md
  • docs/en/solutions/ecosystem/redis/How_to_Rate_Limit_Redis_Traffic.md
  • docs/en/solutions/ecosystem/redis/How_to_Recover_From_Cross_Shard_Master_Corruption.md
  • docs/en/solutions/ecosystem/redis/How_to_Recover_From_Redis_Cluster_Crash.md
  • docs/en/solutions/ecosystem/redis/How_to_Repair_Redis_Cluster_Slot_Anomalies.md
  • docs/en/solutions/ecosystem/redis/How_to_Resolve_Master_Replica_Sync_Failure.md
  • docs/en/solutions/ecosystem/redis/How_to_Run_Redis_as_Root_User.md
  • docs/en/solutions/ecosystem/redis/How_to_Set_Sentinel_Node_Password.md
  • docs/en/solutions/ecosystem/redis/How_to_Trigger_Manual_Sentinel_Failover.md
  • docs/en/solutions/ecosystem/redis/How_to_Troubleshoot_Cluster_Mode_Connection_Errors.md
  • docs/en/solutions/ecosystem/redis/How_to_View_Redis_Slow_Logs.md
  • docs/en/solutions/ecosystem/redis/Redis_Best_Practices.md
  • docs/en/solutions/ecosystem/redis/Redis_Emergency_Response_Playbook.md
✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch redis-kb

Tip

💬 Introducing Slack Agent: The best way for teams to turn conversations into code.

Slack Agent is built on CodeRabbit's deep understanding of your code, so your team can collaborate across the entire SDLC without losing context.

  • Generate code and open pull requests
  • Plan features and break down work
  • Investigate incidents and troubleshoot customer tickets together
  • Automate recurring tasks and respond to alerts with triggers
  • Summarize progress and report instantly

Built for teams:

  • Shared memory across your entire org—no repeating context
  • Per-thread sandboxes to safely plan and execute work
  • Governance built-in—scoped access, auditability, and budget controls

One agent for your entire SDLC. Right inside Slack.

👉 Get started


Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds a set of Redis OSS “Solutions” knowledge-base articles (ported from internal Confluence) under docs/en/solutions/ecosystem/redis, covering operational troubleshooting, recovery playbooks, and how-to guides for Alauda Cache Service for Redis OSS on Kubernetes.

Changes:

  • Adds an emergency response playbook plus best-practices guidance for Redis Sentinel/Cluster deployments.
  • Adds multiple troubleshooting/how-to articles (cluster slot/node repair, replication sync failures, Sentinel password/failover, maxmemory, slowlog, traffic limiting, migration via RedisShake, RedisInsight, and proxy deployment).
  • Adds a backup/restore compatibility note for parameter templates.

Reviewed changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 28 comments.

Show a summary per file
File Description
docs/en/solutions/ecosystem/redis/Redis_Emergency_Response_Playbook.md New emergency response runbook for common operator/instance/cluster failure scenarios.
docs/en/solutions/ecosystem/redis/Redis_Best_Practices.md New best-practices and sizing guidance for Sentinel/Cluster mode.
docs/en/solutions/ecosystem/redis/How_to_View_Redis_Slow_Logs.md New slowlog inspection/configuration guide.
docs/en/solutions/ecosystem/redis/How_to_Troubleshoot_Cluster_Mode_Connection_Errors.md New guidance for diagnosing non-cluster-aware client issues.
docs/en/solutions/ecosystem/redis/How_to_Trigger_Manual_Sentinel_Failover.md New procedure for manual Sentinel failover.
docs/en/solutions/ecosystem/redis/How_to_Set_Sentinel_Node_Password.md New procedure for setting/rotating Sentinel-side authentication.
docs/en/solutions/ecosystem/redis/How_to_Run_Redis_as_Root_User.md New guidance for running Redis pods as root for specific storage backends.
docs/en/solutions/ecosystem/redis/How_to_Resolve_Master_Replica_Sync_Failure.md New replication-buffer troubleshooting and tuning procedure.
docs/en/solutions/ecosystem/redis/How_to_Repair_Redis_Cluster_Slot_Anomalies.md New manual slot repair procedures for Redis Cluster.
docs/en/solutions/ecosystem/redis/How_to_Recover_From_Redis_Cluster_Crash.md New recovery procedure for corrupted/missing cluster state (nodes.conf).
docs/en/solutions/ecosystem/redis/How_to_Recover_From_Cross_Shard_Master_Corruption.md New procedure for resolving operator reconciliation issues due to shard/pod role misalignment.
docs/en/solutions/ecosystem/redis/How_to_Rate_Limit_Redis_Traffic.md New guidance for throttling/limiting traffic safely (connections, upstream throttling, BigKeys).
docs/en/solutions/ecosystem/redis/How_to_Migrate_Redis_Across_Clusters.md New RedisShake-based cross-cluster migration procedures and version/image matrix.
docs/en/solutions/ecosystem/redis/How_to_Manually_Remove_Failed_Cluster_Nodes.md New helper script and steps for CLUSTER FORGET cleanup of stale nodes.
docs/en/solutions/ecosystem/redis/How_to_Manage_Dangerous_Redis_Commands.md New ACL-based guidance for managing dangerous commands, plus legacy operator methods.
docs/en/solutions/ecosystem/redis/How_to_Fix_Sentinel_Multi_Instance_Merge.md New recovery guide for legacy Sentinel “merge via IP recycling” failure mode.
docs/en/solutions/ecosystem/redis/How_to_Deploy_RedisInsight_Web_Console.md New Kubernetes manifest and connection notes for RedisInsight.
docs/en/solutions/ecosystem/redis/How_to_Deploy_Redis_Proxy_with_RedisProxyOperator.md New guide for deploying Predixy-based proxy via redis-proxy-operator.
docs/en/solutions/ecosystem/redis/How_to_Connect_to_Redis_Sentinel_with_Navicat.md New client configuration guide for Navicat with Sentinel.
docs/en/solutions/ecosystem/redis/How_to_Configure_Redis_MaxMemory.md New guide for overriding operator maxmemory defaults (runtime vs customConfig).
docs/en/solutions/ecosystem/redis/How_to_Cleanup_Invalid_Cluster_Nodes.md New procedure for orphaned node cleanup after pod IP recycling.
docs/en/solutions/ecosystem/redis/How_to_Add_Custom_Redis_Commands.md New legacy guidance for re-enabling commands via rename-command (pre-ACL).
docs/en/solutions/ecosystem/redis/Backup_Restore_Template_Compatibility.md New compatibility matrix + workaround for restoring RDB backups into AOF-template instances.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

products:
- Alauda Application Services
kind:
- Solution
products:
- Alauda Application Services
kind:
- Solution
products:
- Alauda Application Services
kind:
- Solution
products:
- Alauda Application Services
kind:
- Solution
products:
- Alauda Application Services
kind:
- Solution
Comment on lines +84 to +85
| `slowlog-log-slower-than` | Minimum execution time (in microseconds) for a command to be recorded as a slow log entry. | Set to `0` to log every command. Set to `-1` to disable slow log recording. |
| `slowlog-max-len` | Maximum number of entries retained in the slow log. When the limit is reached, the oldest entry is removed. | Default: `0` (no entries retained). Set to a positive value such as `128` or `1024` for production use. |
Comment on lines +25 to +27
```bash
redis-cli -h <redis-host> -p <redis-port> -a <password>
```
Comment on lines +74 to +87
### 1. Create the Restore Instance With AOF Disabled

When you create the new Redis instance for the restore, override the parameter template to set `appendonly: "no"`. This allows Redis to load `dump.rdb` on startup.

For example, on a `RedisFailover` resource:

```yaml
spec:
redis:
customConfig:
appendonly: "no"
restore:
backupName: <backup-name>
```
| Pub/Sub | `psubscribe`, `publish`, `pubsub`, `punsubscribe`, `subscribe`, `unsubscribe` | `spublish`, `ssubscribe`, `sunsubscribe` |
| Stream | | `xacl`, `xadd`, `xautoclaim`, `xclaim`, `xdel`, `xgroup`, `xinfo`, `xlen`, `xpending`, `xrange`, `xread`, `xreadgroup`, `xrevrange`, `xsetid`, `xtrim` |
| Transaction (not supported in Cluster mode) | `discard`, `exec`, `multi`, `unwatch`, `watch` | |
| Server Management | `command` (proxy), `config` (proxy), `info` (proxy) | `acl`, `bgrewrite`, `bgsave`, `command`, `config`, `dbsize`, `failover`, `flushall`, `flushdb`, `info`, `lastsave`, `latency`, `lolwut`, `memory`, `module`, `monitor`, `psync`, `replconf`, `replicaof`, `role`, `save`, `shutdown`, `slaveof`, `slowlog`, `swapdb`, `sync`, `time` |
- Solution
---

# Resolve Master-Replica Sync Failure
Mark each doc with the platform major(s) it applies to:
- 4.x only: features that require operator >= 3.18 (sentinel password,
  RedisProxyOperator, current best practices)
- 3.x only: legacy-bound docs (rename-command for <= 3.15, sentinel
  multi-instance merge unaffected on 3.18+/Redis 6+, RedisShake migration
  whose image table caps at 3.16.2)
- 3.x and 4.x: generic procedures (slow log, cluster slot ops, manual
  failover, recovery playbooks) that work identically on both majors
@chideat chideat deployed to translate May 7, 2026 03:35 — with GitHub Actions Active
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants