Question 1

What aspects of this architecture are unique to proSkale’s approach?

Accepted Answer

It supports both batch and real-time data processing.
This architecture includes proSkale’s own event driven data migration process that can be customized. It is proven process that saves data migration cost and time
ProSkale’s Serverless Data Lake solution on AWS, supports both batch and real-time and near real-time data ingestion, transformation and analytics.

Question 2

How is the ROI determined? What factors are needed to make a Business Case? How can proSkale help to get a project started?

Accepted Answer

After thorough analysis of the existing on-prem applications, proSkale’s engineers design a serverless system that includes cost and cycle time. These metrics are used to determine ROI. proSkale’s solution generally saves 40% to 50% over on-prem cost.
ProSkale can do a POC to show cost and time savings. Based on the result of the POC, the customer can decide whether to move forward with proSkale’s solution.

Question 3

From a business perspective – how do customers benefit?

Accepted Answer

Benefits for customers include lower operational cost, no infrastructure cost, very low maintenance cost, faster time-to-market and ability to scale the environment in a very short period. Amazon AWS provides automatic scaling. ProSkale ensures that their solution meets regulatory compliance requirements and provides security through AWS’s proven security tools like Identity and Access Management, encryption, role-based access control and multi-factor authentication. The solution captures metadata of the ingested data, thus preventing the Data Lake from becoming a ‘Data Swamp’.

Question 4

Why consider a Serverless Data Lake on AWS?

Accepted Answer

A serverless data lake architecture’s primary advantage is the ability to store objects in a highly durable, secure, and scalable manner with only milliseconds of latency for data access. A serverless solution provides the ability to load any type of data – from web sites, business apps, and mobile apps to even IoT sensors.
Serverless architectures offer greater automation and scalability, more flexibility, and quicker time to market, all at a reduced cost.
Compared to other cloud-based data lake solutions, serverless data lake enables lower latency for data life cycle; data is processed in milliseconds as soon as it is generated.
A serverless data lake solution can also handle small, medium and large files, which is done by splitting large files into smaller files and processing each file in separate serverless job streams running in parallel.

Question 5

What are the challenges & opportunities that a Serverless Data Lake on AWS can address?

Accepted Answer

A Serverless Data Lake provides very low latency for data processing. Data from generation to consumption may take a few seconds when using serverless—much less than when using IaaS framework.
Infrastructure setup takes time and requires operational support, which a serverless solution bypasses for faster time-to-market.
With an IaaS solution, the customer pays for the whole infrastructure, in contrast to the much less costly ‘pay-as-you-go’ serverless model.

Question 6

What is unique about proSkale’s approach?

Accepted Answer

ProSkale’s proven solution of automating the data processing lifecycle enables millisecond latency for the whole lifecycle of data processing—capture to consumption.
More automation with less manual intervention enables self-service for data scientists and analysts. It also results in savings in terms of time spent on operational support for non-serverless (IaaS) data processing solutions.
It enables faster time-to-market and fits well for transaction fraud, gathering customer insights, IOT data processing, near real-time data acquisition and transformation in addition to batch mode of data ingestion and processing.

AWS Implementation & AWS Consulting Services

Why choose us?

Why should you choose AWS?