šŸ«‚ Kinetic - Americas - Extended Maintenance / Outage

And those that haven’t are now fearing it. :thinking:

4 Likes

Justified feeling. We’re SaaS and up and running and I don’t know how many customers were affected. Sure we’re one of the lucky ones who weren’t but it’s still a scary situation to be in.

1 Like

When Epicor tell us they can run Epicor better than we can on-prem but then do this. Makes it hard to believe them.

8 Likes

Is everyone just locked out of Epicor until the maintenance period is complete?
We have access to Epicor, but have lost some data during the maintenance window.

I haven’t seen anyone else complaining about missing data, so I’m was just curious on if we were the only ones.

Thanks.

My quick test is via the web UX … keep getting Bad Gateway on load.
image

This scares me. A lot.

The idea of lost data is unfathomable. I detest using buzzwords but ā€œbest practicesā€ would usually include a full verified backup prior to upgrading a server…hopefully insuring there’s no data lost in the transition.

3 Likes

Just like on-prem, if our app server is running I believe users can log in.

We’re not at the 11:45am second extension without an all clear so in limbo.

unbelievable lies GIF

Eric Wareheim Mind Blown GIF by Tim and Eric

2 Likes

To be clear we’ve only lost records that were changed or added during the maintenance window. But it’s tough when Epicor posts that the maintenance window has been extended.

1 Like

We Are Not Alone GIF by New Amsterdam

image

I don’t get it…if Epicor’s declared a maintenance window, aren’t your environments off-limits until it’s back up?

Am I missing something? I know I’m a little dense but…

EDIT - just read @cmulford 's next message…if they had to roll back to a prior backup instead of the most recent…that will be a nightmare trying to recreate what was lost.

We are back up. Able to login around 6:30am est. Printing started around 11:50. We had some quotes that were entered yesterday that were not in the system this morning. Still trying to get get details on the time they were entered. It appears there was a restore done at some point that rolled back. Hopefully we can more information on that from the cloud team.

1 Like

100% Our first shift starts at 6AM which was supposed to be an hour after the maintenance was scheduled to be completed. We came in with users in system already.

That’s the exact situation I was talking about - you shouldn’t be able to get in your system while they’re doing the maintenance…

3 Likes

There is no easy ā€œdisallow user loginā€ with Epicor that we can get to as SaaS --yet?–. Which would be great even for on-prem.

edited for clarity

1 Like

It’ll be something like a backup fumble. Something not unlike, an automated backup routine specific to this event didn’t produce a recoverable .bak and a standard prior backup was used as a fallback. It’s not great but would limit the scope of the fumble and suggests that the daily automated backups are recoverable.

Migrating any production database across SQL Server updates is a massive pain. At this scale some fallout is absolutely guaranteed with a drop dead switchover. Ideally it would be done incrementally, but I suspect there are far too many client databases sharing each SQL Server for that.

We got off easy, we’re up. Looks like we were down from 2300ish Saturday and back up 0600ish Sunday. @@version says we’re on 16.0.4185.3

1 Like

Howzabout stopping the app server?

5 Likes

That’s fine and dandy until you need Conversion Workbench and have to start the App Server. Granted since this was SQL server no reason it should of been running at all.

1 Like

For all of my fellow SaaS comrades - you’re entitled to a credit under the SLA because of the downtime.

5 Likes