Today's cybersecurity headlines are brought to you by ThreatPerspective


Ethical Hacking News

The Matrix.org Homeserver Goes Down: A Look into the Causes and Consequences of a Decentralized Messaging Service's Failure


The Matrix.org Homeserver Goes Down: A Look into the Causes and Consequences of a Decentralized Messaging Service's Failure

  • Matrix, a messaging platform, experienced an outage due to a RAID failure on its homeserver.
  • The outage, which started on September 2, resulted in queued messages for users.
  • A routine storage upgrade exercise went badly wrong, causing the RAID failure.
  • The Matrix.org homeserver is backed by a PostgreSQL database, which has experienced issues in the past due to table index corruption.
  • Users who rely on their own homeservers remain unaffected, but those using Matrix.org are unable to send or receive messages until restored.
  • The outage highlights the benefits of decentralized systems and the importance of having backup solutions and contingency plans in place.



  • Matrix, a messaging platform developed by the creators of the iconic Matrix franchise, has recently experienced an outage due to a RAID failure on its homeserver. The incident, which began on September 2 at 1117 UTC, resulted in messages sent to Matrix.org users being queued until the service is back online.

    According to Neil Johnson, chief engineering officer at Element, a company that utilizes the underlying technology of Matrix, the trouble started with a routine storage upgrade exercise that went badly wrong. "A whole series of things happened at exactly the wrong time in unison, which then led to the situation that we see," he said. The RAID failure caused issues with the primary database, leading to problems with message sending and joining rooms.

    The Matrix.org homeserver is backed by a large PostgreSQL database, which has experienced issues in the past due to corruption of part of a table index. In July, this issue caused problems with "rooms" in the system, resulting in failed attempts to join rooms, messages not being sent, and occasional cryptic error messages appearing.

    The team behind Matrix.org reported that they have been cautious when restoring the database, ultimately deciding on a full 55 TB database snapshot restore followed by a replay of 17 hours' worth of traffic. This solution aims to restore the primary database to a state where it can be confidently run as a primary.

    While users who rely on their own homeservers, such as government organizations, remain unaffected, those using Matrix.org as their homeserver are unable to send or receive messages until the service is restored. The organization assured that there would be no data loss and that eventually, all messages will get through.

    The incident highlights the benefits of a decentralized system, where users with their own homeservers are not affected by issues on the main Matrix.org server. This allows organizations like Element to utilize the underlying technology without being impacted by problems on the service level.

    Matrix has become increasingly important in recent years as public and private sector organizations seek to reduce their dependency on centralized messaging services that might not meet sovereignty or privacy requirements. The outage serves as a reminder of the importance of having backup solutions and contingency plans in place, especially for critical systems like the one provided by Matrix.org.

    The incident also underscores the need for organizations to closely monitor and maintain their infrastructure, including regular backups and updates to ensure that issues are addressed promptly before they escalate into full-blown failures. As technology continues to advance at a rapid pace, it is essential that organizations prioritize proactive maintenance and disaster recovery strategies to mitigate potential risks.

    In conclusion, the outage of Matrix.org's homeserver serves as a valuable lesson for those relying on decentralized messaging services. While the incident may seem like a setback for users, it highlights the benefits of having multiple layers of protection in place, such as the ability to utilize one's own homeserver or rely on alternative solutions.

    As the importance of secure and private communication continues to grow, organizations must take proactive steps to ensure that their systems are equipped with robust backup solutions and contingency plans. By doing so, they can minimize downtime and ensure that critical services like Matrix.org remain available to users who rely on them.

    Related Information:
  • https://www.ethicalhackingnews.com/articles/The-Matrixorg-Homeserver-Goes-Down-A-Look-into-the-Causes-and-Consequences-of-a-Decentralized-Messaging-Services-Failure-ehn.shtml

  • https://go.theregister.com/feed/www.theregister.com/2025/09/03/matrixorg_raid_failure/

  • https://www.theregister.com/2025/09/03/matrixorg_raid_failure/

  • https://matrix.org/blog/2025/07/postgres-corruption-postmortem/


  • Published: Wed Sep 3 10:04:50 2025 by llama3.2 3B Q4_K_M













    © Ethical Hacking News . All rights reserved.

    Privacy | Terms of Use | Contact Us