Bay Area In-Memory Computing Meetup
Join us June 24 in Menlo Park for the next Bay Area In-Memory Computing Meetup! We'll have valuable talks, a delicious BBQ dinner and more. RSVP now to reserve your spot.
> Talk 1: "Feeding data to the Kubernetes beast: bringing data locality to your containerized big data workloads." With Bin Fan, founding engineer of Alluxio, Inc. and PMC member of Alluxio open source project.
> Talk 2: "Best Practices for Native Persistence and Data Recovery." With Valentin (Val) Kulichenko, lead architect at GridGain Systems.
>> Talk 1 (Val): As an in-memory computing platform, GridGain and Apache Ignite support native persistence that stores data and indexes transparently on non-volatile memory, SSD or disk. When persistence is enabled, memory becomes a cache for the most frequently used data and indexes. Native persistence is ACID-compliant, durable and enables immediate availability on a restart of each node. Data is never lost; GridGain supports full and incremental snapshots along with continuous archiving, and provides Point-in-Time recovery to an individual transaction.
Val will share insights into the underlying architecture and best practices for implementing native persistence in production. His talk will cover:
- An architectural overview of native persistence, and centralized backup and recovery
- Tips and tricks for for configuring and managing persistence
- Best practices for checkpointing, using the Write-Ahead Log (WAL) and restoring from a failure
- Performance tuning recommendations to balance durability and performance, including how to create snapshots under load
>> Talk 2 (Bin): The latest advances in container orchestration by Kubernetes bring cost savings and flexibility to compute workloads in public/hybrid cloud / multi cloud environment. Such architectures typically lead to physically separated compute and storage services, where S3, Azure Data Lake or Google Cloud Storage are commonly used to provide data persistence.
Open source Alluxio approaches this problem in a new way. It helps elastic compute workloads realize the true benefits of the cloud, while bringing data locality and data accessibility to workloads orchestrated by Kubernetes. Alluxio can orchestrate data locality from any persistent storage including object store such as Ceph and cloud storage such as AWS S3 or GCS and make it accessible to compute running in Kubernetes pods. As a stateless data access layer (as opposed to a long-running data storage daemon service), Alluxio runs as a native service making data-intensive compute workloads Kubernetes friendly.
Bin will explain how this new approach of bringing data locality to data-intensive compute workloads in Kubernetes environments. He'll also cover the Alluxio, architecture, deployment with Kubernetes and real-world production use cases.
This is a free event but you must RSVP to attend.