Streaming-OODT: An Open-Source Platform for Big-Data Processing

Streaming-OODT was originally conceived to overcome the limitations of traditional big-data processing and management systems. It is based on an open source data processing framework called OODT (Object Oriented Data Technology) and was funded by NASA Jet Propulsion Laboratory's Big Data Research & Technology Development initiative "Archiving, Processing and Dissemination for the Big Data Era". The vision behind the project is to combine state-of-the- art technologies into an easy-to-use big-data processing system prepackaged to allow users to quickly process big-data without the need to patch together individual technologies.

Streaming-OODT provides both traditional batch processing as well as in-memory MapReduce processing for use on general computing clusters. Cluster management and multi-tenancy is provided via Apache Mesos, which manages batch processing as well as the Streaming-OODT's underlying technologies. This ensures that multi-tenancy is applied to both the system and the user's processing.

Apache Spark provides in-memory MapReduce processing enabling processing at speeds hundreds of times faster than Hadoop MapReduce. This system is augmented by Apache Kafka used to manage streaming data. This enables the user to process streaming data alongside traditional data in Apache Spark and thus tackle data-sets too large to persist en-masse to disk, while not losing the ability to process data sets that already exist on disk.

Tachyon, an in-memory distributed file system, provides lightning-fast distributed access to data files and streams on all nodes of the cluster. Persistence is provided by Hadoop Distributed File System (HDFS) thus allowing the user both fast data access and persistence to disk.

The purpose of this talk is to demonstrate Streaming-OODT, which will enable the audience to use Streaming-OODT and supporting technologies to quickly tackle their own big-data problems. The talk will introduce Streaming-OODT, show how to quickly install and configure the system, explain the value added by the underlying technologies, and walk through a working example of big-data processing. Finally, benchmarks will be presented so that the audience can see the benefit of these technologies and their combination.

Speaker: Michael Starch

Meetup Event Page


Red Hat and Open Shift

Come to this meetup to understand more about Red Hat and OpenShift, Red Hat's multi-language platform-of-a-service solution.

About Red Hat:

As you can imagine, Red Hat has grown from a single Linux distribution to a 20 year old, 1.5+ billion USD company with a portfolio of products in Operating System, Virtualization, Storage, Middleware and Cloud. We'll do a general overview of the company and talk about where the company is going to in the future.

About Red Hat OpenShift:

OpenShift is Red Hat's award winning, #1 leader on Gartner's Magic Quadrant Platform-as-a-Service (PaaS) that allows developers to quickly develop, host, and scale applications in a cloud environment.

Speaker:

Albert T. Wong is a solution architect at Red Hat. Jack of all trades in JavaEE, Red Hat and IBM Software and open source. Has a deep background in eCommerce and building, deploying and scaling very large websites. Albert is a UC Irvine Information and Computer Science alum.

Meetup Event Page

Presentation Material:


AROS Research Operating System

AROS is an open source re-implementation of Amiga OS 3.1. This year with the Directory Opus 5 bounty, the ABI V1 on-trunk merge and the release of Icaros v2, that will be covered, it has been a very significant year for the project.

Simone Bernacchia is a web developer with a background as graphic artist and musician on a couple of Amiga games. He also has a blog where he writes about retro Amiga games, protracker music composing and AROS.

Speaker: Simone Bernacchia

Meetup Event Page

Presentation Material: PDF


Amateur High Altitude Ballooning

This talk will cover the stories and engineering challenges faced by the Null Space Labs High Altitude Ballooning team and how they designed, analyzed, and flew hardware and software to survive the harsh conditions of near space. Recent flights include CUBEX, a payload capable of live image transmission, solar changing, power management, and position reporting, as well as the HABEXpico flight which flew half way across the United States.

Presenter: Arko

Meetup Event Page


Links mentioned in the talk:


Git and GitHub Tips and Tricks

Git is a popular distributed version control system, and GitHub is a platform for Git users to share their code and projects with others, among other things. Lan will share tips and tricks for Git and GitHub that will be useful to users of all levels. Feel free to come and share your own tips.

Presenter: Lan Dang

Meetup Event Page


PostgreSQL Authentication Mechanisms.

PostgreSQL is the only open source database that supports identifying users with either of the non-password standards, and the ability to do it on both Windows and Unix platforms. Henry Hotz will illustrate how to make use of each, and discuss the security implications of using better authentication mechanisms.

Presenter: Henry Hotz