基因数据处理28之avocado运行
发布时间:2021-03-07 01:15:01 所属栏目:大数据 来源:网络整理
导读:需要注意的是如果使用avocado的命令行,fs和fq为hdfs路径,properties为本地路径: hadoop @Master :~/xubo/data/testTools/se $ avocado-submit /xubo/avocado/hs1.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527 /home/hadoop/cloud/avocado/basi
需要注意的是如果使用avocado的命令行,fs和fq为hdfs路径,properties为本地路径: hadoop@Master:~/xubo/data/testTools/se$ avocado-submit /xubo/avocado/hs1.fq /xubo/avocado/hs38DH.fa /xubo/avocado/test20160527 /home/hadoop/cloud/avocado/basic.properties Using SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit Loading reads in from /xubo/avocado/hs1.fq [Stage 8:> (0 + 2) / 4]SLF4J: Failed to load class "org.slf4j.impl.StaticLoggerBinder". SLF4J: Defaulting to no-operation (NOP) logger implementation SLF4J: See http://www.slf4j.org/codes.html#StaticLoggerBinder for further details. hadoop@Master:~/xubo/data/testTools/se$ hadoop fs -ls /xubo/avocado/test20160527 Found 7 items -rw-r--r-- 3 hadoop supergroup 0 2016-05-27 22:32 /xubo/avocado/test20160527/_SUCCESS -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:32 /xubo/avocado/test20160527/_common_metadata -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:32 /xubo/avocado/test20160527/_metadata -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00000.gz.parquet -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00001.gz.parquet -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:32 /xubo/avocado/test20160527/part-r-00002.gz.parquet -rw-r--r-- 3 hadoop supergroup 13367 2016-05-27 22:31 /xubo/avocado/test20160527/part-r-00003.gz.parquet 详细请见: hadoop@Master:~/xubo/data/testTools/se$ avocado-submit Using SPARK_SUBMIT=/home/hadoop/cloud/spark-1.5.2//bin/spark-submit Argument "READS" is required READS : ADAM read-oriented data REFERENCE : ADAM or FASTA reference genome data VARIANTS : ADAM variant output CONFIG : avocado configuration file -debug : If set,prints a higher level of debug output. -fragment_length N : Sets maximum fragment length. Default value is 10,000. Values greater than 1e9 should be avoided. -h (-help,--help,-?) : Print help -parquet_block_size N : Parquet block size (default = 128mb) -parquet_compression_codec [UNCOMPRESSED | SNAPPY | GZIP | LZO] : Parquet compression codec -parquet_disable_dictionary : Disable dictionary encoding -parquet_logging_level VAL : Parquet logging level (default = severe) -parquet_page_size N : Parquet page size (default = 1mb) -print_metrics : Print metrics to the log on completion 参考: (编辑:威海站长网) 【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容! |