Data Unpacked 001 – Defining a Data Platform

Chris EvansData Management, Data Unpacked Podcast, Guest Speakers

In the first of a new series of podcasts, Chris Evans is in discussion with Colin Gallagher (VP of Product Marketing) and Joel Kaufman (Senior Technical Marketing Manager), both with data management start-up Weka. Weka recently announced Weka 4, a major upgrade to the technology offered by the company. At the same time, the distributed file system that was WekaFS is now being marketed as a “data platform”. We wanted to get into the details of how a file system and a platform can be defined and how specific features and functionality separate the two.

The conversation takes us through understanding the Weka technology, looking at aspects such as data variety, user experience, data mobility, data efficiency and dynamic scalability that all form part of the requirements of a platform.

For more details on Weka, check out the company website at https://www.weka.io/ or our blog content at https://www.architecting.it/blog/category/vendors/weka/.

Timeline

  • 00:00:00 – Intros
  • 00:02:00 – Weka 4 introduces the term “data platform”
  • 00:04:10 – What are the key features of the Weka platform?
  • 00:06:20 – Weka has already demonstrated performance credentials
  • 00:07:11 – What makes a data platform more than a file system?
  • 00:08:52 – Weka rebuilds on modern technology
  • 00:10:46 – It’s no longer necessary to deploy multiple platforms for multiple use cases
  • 00:12:43 – Don’t tell me that data is the new oil
  • 00:14:00 – Data is processed in pipelines, rather than in batch
  • 00:15:21 – Petabyte scale is no longer “big”
  • 00:18:28 – The I/O blender is back for big data
  • 00:20:12 – How do platforms solve the data growth problem?
  • 00:21:56 – New NVMe drives are becoming cost effective in combination with object stores
  • 00:23:41 – A Data Platform should automatically manage the data placement & efficiency issues
  • 00:25:54 – Data should be mobile and available when required
  • 00:26:52 – How does the data platform change the user experience?
  • 00:28:25 – The public cloud has driven the simplicity of data management
  • 00:30:50 – Data Platforms should be flexible and dynamic
  • 00:32:30 – Customers want longevity and ephemeral storage platforms
  • 00:36:00 – 23 & Me is a good example of a seasonal demand requirement
  • 00:38:04 – Demand scaling can be for workflow process, like Virtual Effects companies
  • 00:40:11 – Data is alive and has flow, movement and changeability
  • 00:42:12 – Wrap Up

Copyright (c) 2022 Unpacked Network. Post #6c26. Do not reproduce without permission, in part, or whole.