Parallel Hexagonal Cellular Automaton Framework by asegizek

Project Proposal

Summary

We are going to make a parallel cellular automaton framework for hexagonal grids that will run on CUDA. It will be able to efficiently run cellular automata with many states and any set of rules which determine state based on a cell’s surrounding neighborhood.

Background

Cellular automata are discrete models used to simulate various processes often inspired by nature. It usually consists of a grid of cells where each cell is in one of a finite number of states. For every time step the state of each cell is updated based on a predefined rule with respect to its neighboring cells. For example in the famous Game of Life each cell in the “live” state becomes “dead” if there are fewer than 2 live neighboring cells or more than 3 live cells. Whereas any dead cell can become live again if it is surrounded by exactly 3 live cells. These simple rules can create complex simulations. We want to extend the idea of cellular automata to something more challenging than just a square tiled grid: a hexagonally tiled grid. This different layout provides its own sets of challenges which we will cover in the next section.

Challenge

Creating a parallel cellular automaton for a normal cartesian grid has multiple challenges, such as determining how to split up the work, handling cells at the edge of a thread’s region, or determining if there are sections of the grid where no work needs to be done. Creating a cellular automaton for a hexagonal grid includes all these challenges in addition to many more. As all of the previous parallel programs we’ve worked on so far have operated on grids of squares, working with a hexagonal grid will require different techniques than the ones we’ve used previously. Mapping a hexagonal grid to memory isn’t a clear cut task, and finding the most efficient way to do this will be necessary. In addition to this, using CUDA will have its own set of challenges as we’ll need to find the most efficient way to allocate irregularly shaped groups of hexagons between blocks and threads. Finding the best approach to take for solving these problems should prove to be an interesting exercise.

Resources

We will be using the gates machines with the NVIDIA GPU’s for our project. We will code this project using C++ and we will not be using any starter code although we will probably use a third party library to render our simulations.

Goals

We plan to create a fast general cellular automata simulator for a hexagonal grid. We will allow an arbitrarily large number of possible cell states, and allow for a cell to change its state based on any rule which looks at the states of its neighboring cells. We are unsure of how fast we’d like our simulation to be, but it should at least be significantly faster than the serial implementation, and hopefully be on the same order of magnitude as the very advanced, professionally written cellular automaton simulator Golly. If things go better than expected and we are ahead of schedule we would hope to be able to extend the framework to allow each cell to update not only based on its neighboring cells but perhaps to cells farther away, granting the framework a lot more flexibility. However if we find ourselves behind schedule then we expect ourselves to specialize our framework towards one particular cellular automaton and achieve decent speedup there. For the demo we plan to showcase the speedup graphs of our parallel framework and hopefully the real time simulation of our framework in action while ssh’ed to a Gates machine. However if it turns out the connection is too slow we will pre-record a video showcasing the real time simulations of both the sequential and parallel versions of various cellular automata.

Platform

For this project we decided to use C++ because we already know how to use it with CUDA and because we’ll be able to make use of the features C++ adds to C.

Schedule

Week	What we plan to get done
Oct 31 - Nov 6	-Create a simple serial implementation -Create simple renderer and interface -Read up on more CUDA features -Plan our approach for the parallel implementation
Nov 7 - Nov 13	-Begin simple parallel implementation
Nov 14 - Nov 20	-Test parallel implementation to locate bottlenecks -Improve our algorithm based on tests -Start and finish checkpoint write-up
Nov 21 - Nov 27	-Continue improving algorithm -Find ways to optimize algorithm to ignore large regions of grid consisting only of “dead” cells
Nov 28 - Dec 4	-Improve flexibility of framework -Do more debugging and testing
Dec 5 - Dec 11	-(Optional) Add a gui interface for framework -Start collecting performance data for the final report -Start writing final report and prepare for presentation
Dec 12 - 13	-Present our project