The problem
Input: large, irregular data graph
Output: index structure for evaluating
regular path expressions