TechEcho

8 comments

tcas10 months ago

My guess is this is all due to CloudWatch logs putlogevents failures.By default a docker container configured with awslogs runs in "blocking" mode. As logs get logged, docker will buffer them and push to CloudWatch logs frequently. In case the log stream is faster than what the buffer can absorb, stdout/stderr get blocked and then the container will freeze on the logging write call. If putlogevents is failing, buffers are probably filling up and freezing containers. I assume most of AWS uses it's own logging system, which could cause these large, intermittent failures.If you're okay dropping logs, add something like this to the container logging definition:<pre><code> "max-buffer-size": "25m" "mode": "non-blocking"</code></pre>

评论 #41115513 未加载

ackdesha10 months ago

It seems to have cascaded from AWS Kinesis...[03:59 PM PDT] We can confirm increased error rates and latencies for Kinesis APIs within the US-EAST-1 Region. We have identified the root cause and are actively working to resolve the issue. As a result of this issue, other services, such as CloudWatch, are also experiencing increase error rates and delayed Cloudwatch log delivery. We will continue to keep you updated as we make progress in resolving the issue.39 affected services listed:AWS Application Migration ServiceAWS Cloud9AWS CloudShellAWS CloudTrailAWS CodeBuildAWS DataSyncAWS ElementalAWS GlueAWS IAM Identity CenterAWS Identity and Access ManagementAWS IoT AnalyticsAWS IoT Device DefenderAWS IoT Device ManagementAWS IoT EventsAWS IoT SiteWiseAWS IoT TwinMakerAWS License ManagerAWS OrganizationsAWS Step FunctionsAWS Transfer FamilyAmazon API GatewayAmazon AppStream 2.0Amazon CloudSearchAmazon CloudWatchAmazon ConnectAmazon EMR ServerlessAmazon Elastic Container ServiceAmazon Kinesis AnalyticsAmazon Kinesis Data StreamsAmazon Kinesis FirehoseAmazon Location ServiceAmazon Managed GrafanaAmazon Managed Service for PrometheusAmazon Managed Workflows for Apache AirflowAmazon OpenSearch ServiceAmazon RedshiftAmazon Simple Queue ServiceAmazon Simple Storage ServiceAmazon WorkSpaces

评论 #41115387 未加载

评论 #41116066 未加载

jmward0110 months ago

This is a bigger deal than the 'degraded' implies. SQS has basically ground to a halt for reads which is leading to massive slowdowns where I am at and the logging issues are causing task timeouts.

rushingcreek10 months ago

The us-east-1 curse strikes again! Elastic Container Service is down for us completely.

chucky_z10 months ago

This is just starting to effect us, looks like SQS is the biggest loser right now.

评论 #41116043 未加载

WheatMillington10 months ago

Our accounting system Xero is down, with reference on their status page to AWS. Related to this, I assume.<a href="https://status.xero.com/" rel="nofollow">https://status.xero.com/</a>

cout10 months ago

Though it is not listed in the 33 affected services, we are seeing an issue communicating with S3 via a Storage Gateway.

catlifeonmars10 months ago

Managed CloudFormation StackSets aren’t showing up for me. I assume this is related to Organizations.

8 comments

tcas10 months ago

评论 #41115513 未加载

ackdesha10 months ago

评论 #41115387 未加载

评论 #41116066 未加载

jmward0110 months ago

This is a bigger deal than the 'degraded' implies. SQS has basically ground to a halt for reads which is leading to massive slowdowns where I am at and the logging issues are causing task timeouts.

rushingcreek10 months ago

The us-east-1 curse strikes again! Elastic Container Service is down for us completely.

chucky_z10 months ago

This is just starting to effect us, looks like SQS is the biggest loser right now.

评论 #41116043 未加载

WheatMillington10 months ago

Our accounting system Xero is down, with reference on their status page to AWS. Related to this, I assume.<a href="https://status.xero.com/" rel="nofollow">https://status.xero.com/</a>

cout10 months ago

Though it is not listed in the 33 affected services, we are seeing an issue communicating with S3 via a Storage Gateway.

catlifeonmars10 months ago

Managed CloudFormation StackSets aren’t showing up for me. I assume this is related to Organizations.

AWS Operational issue – Multiple services in us-east-1

8 comments

AWS Operational issue – Multiple services in us-east-1

8 comments