AWS Certification: Kinesis Questions

Amazon Kinesis

Overview
Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications. Amazon Kinesis enables you to process and analyze data as it arrives and respond instantly instead of having to wait until all your data is collected before the processing can begin.
1. A company has a set of web servers. It is required to ensure that all the logs from these web servers can be analyzed in real time for any sort of threat detection. Which of the following would assist in this regard?

A. Upload dall the logs to the SQS Service and then use EC2 Instances to scan the logs.

B. Upload the logs to Amazon Kinesis and then analyze the logs accordingly.

C. Upload the logs to CloudTrail and then analyze the logs accordingly.

D. Upload the logs to Glacier and then analyze the logs accordingly.

Answer

B. Upload the logs to Amazon Kinesis and then analyze the logs accordingly.

AWS Documentation provides the following information to support this requirement: Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications. For more information on Amazon Kinesis, please refer to the below URL: https://aws.amazon.com/kinesis/


2. A company has an infrastructure that consists of machines which keep sending log information every 5 minutes. The number of these machines can run into thousands and it is required to ensure that the data can be analyzed at a later stage. Which of the following would help in fulfilling this requirement?

A. Use Kinesis Firehose with S3 to take the logs and store them in S3 for further processing.

B. Launch an Elastic Beanstalk application to take the processing job of the logs.

C. Launch an EC2 instance with enough EBS volumes to consume the logs which can be usedfor further processing

D. Use CloudTrail to store all the logs which can be analyzed at a later stage.

Answer

A. Use Kinesis Firehose with S3 to take the logs and store them in S3 for further processing.

Amazon Kinesis Data Firehose is the easiest way to load streaming data into data stores and analytics tools. It can capture, transform, and load streaming data into Amazon S3, Amazon Redshift, Amazon Elasticsearch Service, and Splunk, enabling near real-time analytics with existing business intelligence tools and dashboards you’re already using today.

For more information on Amazon Kinesis firehose, please visit the following URL:

https://aws.amazon.com/kinesis/data-firehose/


3. A company is planning on testing a large set of IoT enabled devices. These devices will be streaming data every second. A proper service needs to be chosen in AWS which could be used to collect and analyze these streams in real time. Which of the following could be used for this purpose?

A. Use AWS EMR to store and process the streams.

B. Use AWS Kinesis streams to process and analyze the data.

C. Use AWS SQS to store the data.

D. Use SNS to store the data.

Answer

B. Use AWS Kinesis streams to process and analyze the data.

Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications.

For more information on Amazon Kinesis, please refer to the below URL:

https://aws.amazon.com/kinesis/ Option A: Amazon EMR can be used to process applications with data intensive workloads.

Option B: Amazon Kinesis can be used to store, process and analyse real time streaming data.

Option C: SQS is a fully managed message queuing service that makes it easy to decouple and scale microservices, distributed systems, and serverless applications.

Option D: SNS is a flexible, fully managed pub/sub messaging and mobile notifications service for coordinating the delivery of messages to subscribing endpoints and clients.


4. IOT sensors monitor the number of bags that are handled at an airport. The data gets sent back to a Kinesis stream with default settings. Every alternate day, the data from the stream is sent to S3 for processing. But it is noticed that S3 is not receiving all of the data that is being sent to the Kinesis stream. What could be the reason for this?

A. The sensors probably stopped working on somedays, hence data is not sent to the stream.

B. S3 can only store data for a day.

C. Data records are only accessible for a default of 24 hours from the time they areadded to a stream.

D. Kinesis streams are not meant to handle IoT related data.

Answer

C. Data records are only accessible for a default of 24 hours from the time they areadded to a stream.

Kinesis Streams support changes to the data record retention period of your stream. A Kinesis stream is an ordered sequence of data records meant to be written to and read from in real-time. Data records are therefore stored in shards in your stream temporarily. The time period from when a record is added to when it is no longer accessible is called theretention period. A Kinesis stream stores records from 24 hours by default, up to 168 hours. Option A, even though a possibility, cannot be taken for granted as the right option. Option B is invalid since S3 can store data indefinitely unless you have a lifecycle policy defined. Option D is invalid because the Kinesis service is perfect for this sort of data ingestion. For more information on Kinesis data retention, please refer to the below URL: http://docs.aws.amazon.com/streams/latest/dev/kinesis-extended-retention.html


5. You are deploying an application to track the GPS coordinates of delivery trucks in the United States. Coordinates are transmitted from each delivery truck once every three seconds. You need to design an architecture that will enable real-time processing of these coordinates from multiple consumers. Which service should you use to implement data ingestion?

A. Amazon Kinesis

B. AWS Data Pipeline

C. Amazon AppStream

D. Amazon Simple Queue Service

Answer

A. Amazon Kinesis

Kinesis Streams support changes to the data record retention period of your stream. A Kinesis stream is an ordered sequence of data records meant to be written to and read from in real-time. Data records are therefore stored in shards in your stream temporarily. The time period from when a record is added to when it is no longer accessible is called theretention period. A Kinesis stream stores records from 24 hours by default, up to 168 hours. Option A, even though a possibility, cannot be taken for granted as the right option. Option B is invalid since S3 can store data indefinitely unless you have a lifecycle policy defined. Option D is invalid because the Kinesis service is perfect for this sort of data ingestion. For more information on Kinesis data retention, please refer to the below URL: http://docs.aws.amazon.com/streams/latest/dev/kinesis-extended-retention.html


6. You currently work for a company that is specialised in baggage management. GPS devices installed on all the baggages, deliver the coordinates of the unit every 10 seconds. You need to process these coordinates in real-time from multiple sources. Which tool should you use to process the data?

A. Amazon EMR

B. Amazon SQS

C. AWS Data Pipeline

D. Amazon Kinesis

Answer

D. Amazon Kinesis

Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost-effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as video, audio, application logs, website clickstreams, and IoT telemetry data for machine learning, analytics, and other applications. Amazon Kinesis enables you to process and analyze data as it arrives and respond instantly instead of having to wait until all your data is collected before the processing can begin. For more information on Amazon Kinesis, please visit the link below. https://aws.amazon.com/kinesis/


7. You require the ability to analyze a customer’s clickstream data on a website so they can do behavioral analysis. Your customer needs to know what sequence of pages and ads their customer clicked on. This data will be used in real time to modify the page layouts as customers click through the site to increase stickiness and advertising click-through. Which option meets the requirements for captioning and analyzing this data?

A. Log clicks in weblogs by URL store to Amazon S3, and then analyze with Elastic MapReduce.

B. Push web clicks by session to Amazon Kinesis and analyze behavior using Kinesis workers.

C. Write click events directly to Amazon Redshift and then analyze with SQL.

D. Publish web clicks by session to an Amazon SQS queue. Then send the events to AWS RDsfor further processing.

Answer

B. Push web clicks by session to Amazon Kinesis and analyze behavior using Kinesis workers. The AWS Documentation mentions the following

Amazon Kinesis Data Streams enables you to build custom applications that process or analyze streaming data for specialized needs. Kinesis Data Streams can continuously capture and store terabytes of data per hour from hundreds of thousands of sources such as website clickstreams, financial transactions, social media feeds, IT logs, and location-tracking events.

For more information on Amazon Kinesis, please visit the following URL:

https://aws.amazon.com/kinesis/data-streams/


8. Your current log analysis application takes more than four hours to generate a report of the top 10 users of your web application. You have been asked to implement a system that can report this information in real time, ensure that the report is always up to date, and handle increases in the number of requests to your web application. Choose the option that is cost-effective and can fulfill the requirements.

A. Publish your data to CloudWatch Logs, and configure your application to Auto Scale tohandle the load on demand.

B. Publish your log data to an Amazon S3 bucket. Use AWS CloudFormation to create an AutoScaling group to scale your post-processing application which is configured topull down your log files stored in Amazon S3.

C. Post your log data to an Amazon Kinesis data stream, and subscribe yourlog-processing application so that is configured to process your logging data.

D. Configure an Auto Scaling group to increase the size of your Amazon EMR cluster.

Answer

C. Post your log data to an Amazon Kinesis data stream, and subscribe yourlog-processing application so that is configured to process your logging data.

Amazon Kinesis makes it easy to collect, process, and analyze real-time, streaming data so you can get timely insights and react quickly to new information. Amazon Kinesis offers key capabilities to cost effectively process streaming data at any scale, along with the flexibility to choose the tools that best suit the requirements of your application. With Amazon Kinesis, you can ingest real-time data such as application logs, website clickstreams, IoT telemetry data, and more into your databases, data lakes and data warehouses, or build your own real-time applications using this data. Amazon Kinesis enables you to process and analyze data as it arrives and respond in real-time instead of having to wait until all your data is collected before the processing can begin. For more information on AWS Kinesis, please see the below link: https://aws.amazon.com/kinesis/