ViewVC Help
View File | Revision Log | Show Annotations | View Changeset | Root Listing
root/owl/trunk/tools/DataDistributer.java
Links to HEAD: (view) (annotate)
Sticky Revision:
(Current path doesn't exist after revision 950)
Sort logs by:
Revision 110 - (view) (annotate) - [select for diffs]
Modified Tue Sep 19 13:04:55 2006 UTC (18 years ago) by duarte
File length: 22855 byte(s)
Diff to previous 104
New class MySQLConnectionCheck, to check whether MySQL servers are alive, uses threads
Changes in DataDistribution:
- new field: String[] nodes, initialised in constructor
- changed constructor to check for nodes that are alive, using new method nodesAlive
- renamed getNodes to getMySQLNodes. New method getNodes just returns the new field String[] nodes
DataDistributer: renamed calls to DataDistribution's getNodes to getMySQLNodes
Revision 104 - (view) (annotate) - [select for diffs]
Modified Tue Aug 8 11:10:01 2006 UTC (18 years, 1 month ago) by duarte
File length: 22825 byte(s)
Diff to previous 102
Modified finalizeDirs() to be tri-state: prompt, force or noremove. By default force. (using new field rmvtmp)
Getting rid of temp dir created in constructor when calling setDumpDir, otherwise an empty tmp dir was left lying around

Revision 102 - (view) (annotate) - [select for diffs]
Modified Fri Aug 4 14:34:00 2006 UTC (18 years, 1 month ago) by duarte
File length: 22303 byte(s)
Diff to previous 96
Method finalizeDirs now prompts by default whether to delete temp dirs, unless new boolean field "force" is true.
Revision 96 - (view) (annotate) - [select for diffs]
Modified Wed May 31 11:02:12 2006 UTC (18 years, 4 months ago) by duarte
File length: 21318 byte(s)
Diff to previous 94
Slight improve in insertIdsToKeyMaster method. Instead of querying in each insert for the nodeid, we get the node names to node id mapping in a HashMap through another query.
Revision 94 - (view) (annotate) - [select for diffs]
Modified Mon May 29 15:22:13 2006 UTC (18 years, 4 months ago) by duarte
File length: 21039 byte(s)
Diff to previous 93
Simplified the code for dealing with text/numerical keys.
Now there's no more double methods Integer/String, rather the methos use a generic Object[] that stores either String[] or Integer[] depending on the case.
Thus modified a few methods in DataDistributer, DataDistribution and MySQLConnection.
Revision 93 - (view) (annotate) - [select for diffs]
Modified Wed May 24 15:44:25 2006 UTC (18 years, 4 months ago) by duarte
File length: 23810 byte(s)
Diff to previous 89
Now split of data also working with text-based keys as well as numerical
MySQLConnection:
- method getAllIds4KeyAndTable now splitted into two methods one for numerical ids and another for text ids
- new methods getColumnType and isKeyNumerical
DataDistribution:
- method getIdSetsFromNodes splitted into two one for numerical ids one for text ids
DataDistributer:
- new methods: splitIdsIntoSets now splitted into two methods one numerical, one text
- change methods: splitTableToCluster, splitTable, insertIdsToKeyMaster, removePK, addPK, createNewKeyMasterTbl, removeZeros, loadSplitData, dumpSplitData to make them work for both text and numeric keys. Introduced generic type T in some of them
- some bugs corrected:
-- an important one in createNewKeyMasterTbl, was introducing record in dbs_keys with srcDb instead of destDb as it should have been
-- some bugs in loadSplitData and dumpSplitData to account for cases in which there are less ids than number of nodes and thus some nodes don't get any data. Wasn't counting with this before.
Revision 89 - (view) (annotate) - [select for diffs]
Modified Fri May 5 10:46:35 2006 UTC (18 years, 4 months ago) by duarte
File length: 18523 byte(s)
Diff to previous 87
Added method to setDumpDir method
Revision 87 - (view) (annotate) - [select for diffs]
Modified Wed May 3 08:50:26 2006 UTC (18 years, 5 months ago) by duarte
File length: 18453 byte(s)
Diff to previous 86
Improved considerably the splitTableToCluster method:
- got rid of the unnecessary step of creating partial tables before dumping.
- now directly dumping with new method dumpSplitData, a modified dumpData that dumps using a WHERE condition
- added variable NUM_CONCURRENT_SAMEHOST_WRITE_QUERIES used in dumpSplitData method. It sets the concurrency when dumping locally only from the master
Revision 86 - (view) (annotate) - [select for diffs]
Modified Fri Apr 28 10:31:35 2006 UTC (18 years, 5 months ago) by duarte
File length: 17940 byte(s)
Diff to previous 85
Added PARALLELISM in load/dump of tables using new class QueryThread (extends Thread)
Modified methods loadData, dumpData and loadSplitData to dump/load parallely in cases that is useful by using the QueryThread class.
New method initializeDirs(String[]) to do some of the dir initialization that was in dumpData
Got rid of one of the getConnectionToNode method, not needed anymore
New important 2 final static variables: NUM_CONCURRENT_READ_QUERIES and NUM_CONCURRENT_WRITE_QUERIES. They define how much concurrency we want in reads/writes to nfs for loads/dumps
Revision 85 - (view) (annotate) - [select for diffs]
Added Mon Apr 24 12:41:27 2006 UTC (18 years, 5 months ago) by duarte
File length: 16265 byte(s)
MAJOR change.
Split DataDistribution into 2 classes: DataDistributer and DataDistribution.
I haven't actually changed or added functionality
DataDistributer deals with the distribution of the data, while DataDistribution deals with things to do when data is already distributed, right now is only a few data checks
Note that DataDistributer now has two db fields: srcDb and destDb. This is different to before, when destDb was rather a parameter passed as arguments to the methods
Methods in DataDistributer have been tidied up a little (specially load and dump ones)

This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, enter a numeric revision.

  Diffs between and
  Type of Diff should be a