IntegrationTestBigLinkedList (Hortonworks Data Platform Apache HBase Java API Reference)

java.lang.Object
- AbstractHBaseTool
- - org.apache.hadoop.hbase.IntegrationTestBase
  - - org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList

Direct Known Subclasses:

IntegrationTestBigLinkedListWithVisibility, IntegrationTestReplication
```
public class IntegrationTestBigLinkedList
extends IntegrationTestBase
```
This is an integration test borrowed from goraci, written by Keith Turner, which is in turn inspired by the Accumulo test called continous ingest (ci). The original source code can be found here: https://github.com/keith-turner/goraci https://github.com/enis/goraci/ Apache Accumulo [0] has a simple test suite that verifies that data is not lost at scale. This test suite is called continuous ingest. This test runs many ingest clients that continually create linked lists containing 25 million nodes. At some point the clients are stopped and a map reduce job is run to ensure no linked list has a hole. A hole indicates data was lost.·· The nodes in the linked list are random. This causes each linked list to spread across the table. Therefore if one part of a table loses data, then it will be detected by references in another part of the table. THE ANATOMY OF THE TEST Below is rough sketch of how data is written. For specific details look at the Generator code. 1 Write out 1 million nodes· 2 Flush the client· 3 Write out 1 million that reference previous million· 4 If this is the 25th set of 1 million nodes, then update 1st set of million to point to last· 5 goto 1 The key is that nodes only reference flushed nodes. Therefore a node should never reference a missing node, even if the ingest client is killed at any point in time. When running this test suite w/ Accumulo there is a script running in parallel called the Aggitator that randomly and continuously kills server processes.·· The outcome was that many data loss bugs were found in Accumulo by doing this.· This test suite can also help find bugs that impact uptime and stability when· run for days or weeks.·· This test suite consists the following· - a few Java programs· - a little helper script to run the java programs - a maven script to build it.·· When generating data, its best to have each map task generate a multiple of 25 million. The reason for this is that circular linked list are generated every 25M. Not generating a multiple in 25M will result in some nodes in the linked list not having references. The loss of an unreferenced node can not be detected. Below is a description of the Java programs Generator - A map only job that generates data. As stated previously,·its best to generate data in multiples of 25M. An option is also available to allow concurrent walkers to select and walk random flushed loops during this phase. Verify - A map reduce job that looks for holes. Look at the counts after running. REFERENCED and UNREFERENCED are· ok, any UNDEFINED counts are bad. Do not run at the· same time as the Generator. Walker - A standalone program that start following a linked list· and emits timing info.·· Print - A standalone program that prints nodes in the linked list Delete - A standalone program that deletes a single node This class can be run as a unit test, as an integration test, or from the command line ex: ./hbase org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList loop 2 1 100000 /temp 1 1000 50 1 0

Field Summary

Fields
Modifier and Type	Field and Description
`protected static byte[]`	`COLUMN_CLIENT`
`protected static byte[]`	`COLUMN_COUNT`
`protected static byte[]`	`COLUMN_PREV`
`protected static java.lang.String`	`DEFAULT_TABLE_NAME`
`protected static byte[]`	`FAMILY_NAME`
`protected static byte[]`	`NO_KEY`
`protected int`	`NUM_SLAVES_BASE`
`protected java.lang.String[]`	`otherArgs`
`protected static java.lang.String`	`TABLE_NAME_KEY`
`protected java.lang.String`	`toRun`
`protected IntegrationTestingUtility`	`util`

Fields inherited from class org.apache.hadoop.hbase.IntegrationTestBase
CHAOS_MONKEY_PROPS, monkey, MONKEY_LONG_OPT, monkeyProps, monkeyToUse, NO_CLUSTER_CLEANUP_LONG_OPT, noClusterCleanUp

Constructor Summary

Constructors
Constructor and Description

IntegrationTestBigLinkedList()

Constructors
Constructor and Description
`IntegrationTestBigLinkedList()`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`cleanUpCluster()`
`protected java.util.Set<java.lang.String>`	`getColumnFamilies()` Provides the name of the CFs that are protected from random Chaos monkey activity (alter)
`TableName`	`getTablename()` Provides the name of the table that is protected from random Chaos monkey activity
`static void`	`main(java.lang.String[] args)`
`protected void`	`processOptions(CommandLine cmd)`
`int`	`runTestFromCommandLine()`
`static void`	`setJobScannerConf(Job job)`
`void`	`setUpCluster()`
`void`	`testContinuousIngest()`

Methods inherited from class org.apache.hadoop.hbase.IntegrationTestBase
addOptions, cleanUp, cleanUpMonkey, cleanUpMonkey, doWork, getConf, getDefaultMonkeyFactory, getTestingUtil, processBaseOptions, setUp, setUpMonkey, startMonkey

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail

NO_KEY
```
protected static final byte[] NO_KEY
```

TABLE_NAME_KEY

protected static java.lang.String TABLE_NAME_KEY

DEFAULT_TABLE_NAME

protected static java.lang.String DEFAULT_TABLE_NAME

FAMILY_NAME
```
protected static byte[] FAMILY_NAME
```

COLUMN_PREV

protected static final byte[] COLUMN_PREV

COLUMN_CLIENT

protected static final byte[] COLUMN_CLIENT

COLUMN_COUNT

protected static final byte[] COLUMN_COUNT

NUM_SLAVES_BASE
```
protected int NUM_SLAVES_BASE
```

toRun
```
protected java.lang.String toRun
```

otherArgs
```
protected java.lang.String[] otherArgs
```

util

protected IntegrationTestingUtility util

Constructor Detail
- IntegrationTestBigLinkedList
```
public IntegrationTestBigLinkedList()
```

Method Detail
- setUpCluster
```
public void setUpCluster()
                  throws java.lang.Exception
```
  Specified by:
  
  setUpCluster in class IntegrationTestBase
  
  Throws:
  
  java.lang.Exception
- cleanUpCluster
```
public void cleanUpCluster()
                    throws java.lang.Exception
```
  Overrides:
  
  cleanUpCluster in class IntegrationTestBase
  
  Throws:
  
  java.lang.Exception
- testContinuousIngest
```
public void testContinuousIngest()
                          throws java.io.IOException,
                                 java.lang.Exception
```
  Throws:
  
  java.io.IOException
  
  java.lang.Exception
- processOptions
```
protected void processOptions(CommandLine cmd)
```
  Overrides:
  
  processOptions in class IntegrationTestBase
- runTestFromCommandLine
```
public int runTestFromCommandLine()
                           throws java.lang.Exception
```
  Specified by:
  
  runTestFromCommandLine in class IntegrationTestBase
  
  Throws:
  
  java.lang.Exception
- getTablename
```
public TableName getTablename()
```
  Description copied from class: IntegrationTestBase
  
  Provides the name of the table that is protected from random Chaos monkey activity
  
  Specified by:
  
  getTablename in class IntegrationTestBase
  
  Returns:
  
  table to not delete.
- getColumnFamilies
```
protected java.util.Set<java.lang.String> getColumnFamilies()
```
  Description copied from class: IntegrationTestBase
  
  Provides the name of the CFs that are protected from random Chaos monkey activity (alter)
  
  Specified by:
  
  getColumnFamilies in class IntegrationTestBase
  
  Returns:
  
  set of cf names to protect.
- setJobScannerConf
```
public static void setJobScannerConf(Job job)
```
- main
```
public static void main(java.lang.String[] args)
                 throws java.lang.Exception
```
  Throws:
  
  java.lang.Exception

Class IntegrationTestBigLinkedList

Field Summary

Fields inherited from class org.apache.hadoop.hbase.IntegrationTestBase

Constructor Summary

Method Summary

Methods inherited from class org.apache.hadoop.hbase.IntegrationTestBase

Methods inherited from class java.lang.Object

Field Detail

NO_KEY

TABLE_NAME_KEY

DEFAULT_TABLE_NAME

FAMILY_NAME

COLUMN_PREV

COLUMN_CLIENT

COLUMN_COUNT

NUM_SLAVES_BASE

toRun

otherArgs

util

Constructor Detail

IntegrationTestBigLinkedList

Method Detail

setUpCluster

cleanUpCluster

testContinuousIngest

processOptions

runTestFromCommandLine

getTablename

getColumnFamilies

setJobScannerConf

main