accumulo-user mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Al Krinker <>
Subject Issue with importDirectory
Date Thu, 01 May 2014 20:46:06 GMT
So I am trying to create my own rFile and write it to accumulo...

in the nutshell.

I create my rFile and two directories. One that would contain the file and
one for failures, both required by importDirectory

        Configuration conf = new Configuration();
        conf.set("", "hdfs://blah:9000/");
        FileSystem fs = FileSystem.get(conf);

        Path input = new Path("/accumulo/temp1/testing/");
        Path output = new Path("/accumulo/temp1/testing/my_output");

        String extension = conf.get(FILE_TYPE);
        if (extension == null || extension.isEmpty()) {
            extension = RFile.EXTENSION;
        String filename = "/accumulo/temp1/testing/my_input/testFile." +
        Path file = new Path(filename);
        if (fs.exists(file)) {
            file.getFileSystem(conf).delete(file, false);
        FileSKVWriter out =
RFileOperations.getInstance().openWriter(filename, fs, conf,
        long timestamp = (new Date()).getTime();
        Key key = new Key(new Text("row_1"), new Text("cf"), new Text("cq"),
                new ColumnVisibility(), timestamp);
        Value value = new Value("".getBytes());
        out.append(key, value);

at this point i can ssh into my namenode and see the file and two

then i try to bulk import it

        String instanceName = "blah";
        String zooServers = "blah:2181,blah:2181"
        String userName = ; // Provide username
        String password = ; // Provide password
        // Connect
        Instance inst = new ZooKeeperInstance(instanceName, zooServers);
        Connector conn = inst.getConnector(userName, password);
        TableOperations ops = conn.tableOperations();
        ops.importDirectory("mynewtesttable", input.toString(),
output.toString(), false);

The exception that I am getting is
SEVERE: null
org.apache.accumulo.core.client.AccumuloException: Bulk import directory
/accumulo/temp1/testing does not exist!

I tried to play around with the file/directory owner by manually setting it
to accumulo and then hadoop, but no luck.

I checked hdfs-site and I have

Any ideas?

Any guesses of what might be wrong?

View raw message