Two weeks since the Optus outage, documents show backroom scrambling and urgent meetings occurred as the emergency played out

0
12


In the two weeks since Optus suffered a devastating nationwide outage of its mobile and internet services, the company has shifted the blame to other organisations, attempted to calm its irate customers, and seen its CEO resign.

As Australia’s second-largest telco and its Singaporean parent company Singtel attempt to mend Optus’s reputation, submissions to a Senate inquiry into the outage have told us more about the backroom scrambles and urgent meetings which unfolded as Optus tried to relieve its major headache.

The company said 150 engineers and technicians formed the “core group of personnel” who worked to fix the outage, while 250 other workers and five international companies also provided support.

Let’s take a look at how it all unfolded on Monday, November 8, and the aftermath that followed.

All times are in Australian Eastern Daylight Time.

4:05am: The Optus network crashes

On the morning of Wednesday, November 8, around 10 million Optus customers and 400,000 businesses began their mornings without mobile or internet service.

Optus said its post-incident analysis had found the loss of connectivity began at around 4:05am.

At that time, the company did not know what had caused the outage.

From 4:05am: Engineers try to find the issue

Optus said its engineers worked with several hypotheses to try to restore its network as soon as the outage began.

The company said this work included, among other things:

  • Rolling back earlier changes to the network, to confirm they weren’t the cause
  • Checking if Optus had been intentionally overloaded by a cyber attack (it appeared it had not)
  • Examining whether the issue was due to problems with network authentication

Optus said it also began to examine media inquiries from 4:27am.

As engineers continued their work, some customers attempted to contact Optus by going to their local store, or contacting Optus helplines if they had access to other service providers.

A group of customers surround a retail worker inside of a shop

Some customers demanded answers from staff at Optus stores during the outage.(
ABC News: Cathy Border
)

6:33am: Optus releases its first statement

The company’s first statement said it was aware of the outage and was working to restore its services as quickly as possible.

Further messages were also posted on Optus social media accounts and the organisation’s website.

7am: The Department of Home Affairs contacts Optus

The Australian government’s Department of Home Affairs, the regulator for telecommunications safety, said some of its senior officials reached out to Optus via encrypted messaging app Signal to offer their assistance.

The department said its national cybersecurity coordinator remained “engaged”, in case Optus eventually discovered that a cyber attack had in fact occurred.

7:02am: Optus notifies the ACMA about emergency call issues

The Australian Communications and Media Authority (ACMA), which regulates broadcasting, radio and telecommunications, and whose remit includes access to triple-0 calls, said it received a notification from Optus at 7:02am that its network outage was “adversely affecting the carriage of emergency calls over the Optus network”.

Optus released statements saying that triple-0 services could not be contacted using Optus landlines, and there were some issues with mobile services when they tried to use other networks to access triple-0 services.

An Optus store sign seen through a border of leaves.

Optus says such a major outage is a rare occurrence.(
ABC News: Daniel Irvine
)

9:30am: The federal government schedules a meeting

The Department of Home Affairs said it sent an email to Optus shortly after 9:30am, inviting it to provide an update on the outage to Australian state and territory governments.

Some of the states and territories were experiencing communications issues within their departments, including some local health and transport authorities.

The meeting with Optus would be coordinated through something called the National Coordination Mechanism (NCM), which was implemented during the COVID-19 pandemic, and was scheduled to take place at 2pm.

10:21am: Optus appears to find the problem, and fixes begin

Optus said it was at this time that its leading hypothesis — a sudden flood of new internet routing information, which had caused some parts of its network to disconnect themselves — was identified as the likely cause of the network crash.

The company said it set about resetting parts of the network by both remotely and physically rebooting and reconnecting some parts.

This system reset is said to have included more than 100 devices in 14 sites across the country.

Optus said it then began to “carefully and methodically re-introducing traffic onto the mobile data and voice core to avoid a signalling surge on the network”.

10:30am: The Optus CEO speaks on ABC Radio

Optus CEO Kelly Bayer Rosmarin was interviewed in a voice-over-internet call with ABC Radio Sydney, using WhatsApp.

She said there was “no indication” of a cyber incident, and the company was working on “a number of hypotheses” to find the cause of the outage.

“The teams are trying many different angles and we will not rest until the service is back up for our customers,” she said.

Ms Bayer Rosmarin added that it was “highly unlikely” that the outage was caused by an overnight software update.

The chief executive took part in 10 other media interviews on the day.

Loading…

10:38am: Customers slowly begin to come back online

Optus said services began to come back for some customers about 20 minutes after the apparent problem was discovered and fixes began to be implemented.

11am: The communications minister gives a press conference

Communications Minister Michelle Rowland held a press conference in Sydney, and told waiting media that she understood the cause of the outage was “deep in the core” of the Optus network.

“The core network basically encompasses everything from routing to electronics. So it is a fault that is quite fundamental to the network,” she said.

“But my understanding, having just recently spoken again to the CEO, is that a number of problems have been identified and that Optus continues to work on this.”

Loading…

11:32am: Optus updates the Telecommunications Ombudsman

The Telecommunications Industry Ombudsman (TIO), which deals with phone and internet disputes for consumers and small businesses, said it had reached out to Optus for updates at 8:12am and received a reply at 11:32am which included, among other things, details of some contact avenues for consumers.

Optus would later set up a team which the TIO could refer consumers to if they wanted to escalate their complaints.

12pm: Half of Optus network sites are back online

Optus said that by 12pm, 56 per cent of its Radio Access Network sites had been restored.

These sites allow people’s individual devices to connect to the wider Optus network, and the internet.

A wide shot, looking to the sky, of a large mobile communications tower with a ladder running through its centre.

Optus says half of its network sites were back online by 12pm AEDT on the day of the outage.(Supplied: Optus)

2pm: Optus attends the meeting with federal and state governments

Optus representatives attended the meeting set up by the Department of Home Affairs under the National Coordination Mechanism. The ACMA, which regulates access to triple-0 calls, was also in attendance.

Home Affairs said Optus briefed officials on what it knew about the outage, but “a number of details remained unclear” about the exact cause of the incident.

By this time, Optus said 98 per cent of its customers had been reconnected to its network.

4pm: Optus declares the outage over

Optus declared at 4pm that its network outage had ended, and said a technical fault was to blame.



Source link