The Resource Learning Apache Drill : query and analyze distributed data sources with SQL, Charles Givre and Paul Rogers

Learning Apache Drill : query and analyze distributed data sources with SQL, Charles Givre and Paul Rogers

Label
Learning Apache Drill : query and analyze distributed data sources with SQL
Title
Learning Apache Drill
Title remainder
query and analyze distributed data sources with SQL
Statement of responsibility
Charles Givre and Paul Rogers
Title variation
Query and analyze distributed data sources with SQL
Creator
Author
Subject
Language
eng
Summary
Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works everywhere: on your laptop or in your largest cluster. In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you'll learn how Drill helps you analyze data more effectively to drive down time to insight. Use Drill to clean, prepare, and summarize delimited data for further analysis ; Query file types including logfiles, Parquet, JSON, and other complex formats ; Query Hadoop, relational databases, MongoDB, and Kafka with standard SQL ; Connect to Drill programmatically using a variety of languages ; Use Drill even with challenging or ambiguous file formats ; Perform sophisticated analysis by extending Drill's functionality with user-defined functions ; Facilitate data analysis for network security, image metadata, and machine learning
Cataloging source
YDX
http://library.link/vocab/creatorName
Givre, Charles
Dewey number
005.741
Illustrations
illustrations
Index
index present
LC call number
QA76.9.F5
LC item number
G587 2019
Literary form
non fiction
http://library.link/vocab/subjectName
  • File organization (Computer science)
  • Querying (Computer science)
  • SQL (Computer program language)
  • Big data
  • Big data
  • File organization (Computer science)
  • Querying (Computer science)
  • SQL (Computer program language)
Label
Learning Apache Drill : query and analyze distributed data sources with SQL, Charles Givre and Paul Rogers
Instantiates
Publication
Copyright
Note
Includes index
Carrier category
volume
Carrier category code
  • nc
Carrier MARC source
rdacarrier
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Contents
Introduction to Apache Drill -- Installing and running Drill -- Overview of Apache Drill -- Querying delimited data -- Analyzing complex and nested data -- Connecting drill to data sources -- Connecting to drill -- Data engineering with Drill -- Deploying Drill in production -- Setting up your development environment -- Writing Drill user-defined functions -- Writing a format plug-in -- Unique uses of Drill -- List of Drill functions -- Drill formatting strings
Dimensions
24 cm
Extent
xvi, 311 pages
Isbn
9781492032793
Media category
unmediated
Media MARC source
rdamedia
Media type code
  • n
Other physical details
illustrations
System control number
  • (OCoLC)1017901276
  • (YBP)ybp15071611
Label
Learning Apache Drill : query and analyze distributed data sources with SQL, Charles Givre and Paul Rogers
Publication
Copyright
Note
Includes index
Carrier category
volume
Carrier category code
  • nc
Carrier MARC source
rdacarrier
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Contents
Introduction to Apache Drill -- Installing and running Drill -- Overview of Apache Drill -- Querying delimited data -- Analyzing complex and nested data -- Connecting drill to data sources -- Connecting to drill -- Data engineering with Drill -- Deploying Drill in production -- Setting up your development environment -- Writing Drill user-defined functions -- Writing a format plug-in -- Unique uses of Drill -- List of Drill functions -- Drill formatting strings
Dimensions
24 cm
Extent
xvi, 311 pages
Isbn
9781492032793
Media category
unmediated
Media MARC source
rdamedia
Media type code
  • n
Other physical details
illustrations
System control number
  • (OCoLC)1017901276
  • (YBP)ybp15071611

Library Locations

  • Architecture LibraryBorrow it
    Gould Hall 830 Van Vleet Oval Rm. 105, Norman, OK, 73019, US
    35.205706 -97.445050
  • Bizzell Memorial LibraryBorrow it
    401 W. Brooks St., Norman, OK, 73019, US
    35.207487 -97.447906
  • Boorstin CollectionBorrow it
    401 W. Brooks St., Norman, OK, 73019, US
    35.207487 -97.447906
  • Chinese Literature Translation ArchiveBorrow it
    401 W. Brooks St., RM 414, Norman, OK, 73019, US
    35.207487 -97.447906
  • Engineering LibraryBorrow it
    Felgar Hall 865 Asp Avenue, Rm. 222, Norman, OK, 73019, US
    35.205706 -97.445050
  • Fine Arts LibraryBorrow it
    Catlett Music Center 500 West Boyd Street, Rm. 20, Norman, OK, 73019, US
    35.210371 -97.448244
  • Harry W. Bass Business History CollectionBorrow it
    401 W. Brooks St., Rm. 521NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • History of Science CollectionsBorrow it
    401 W. Brooks St., Rm. 521NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • John and Mary Nichols Rare Books and Special CollectionsBorrow it
    401 W. Brooks St., Rm. 509NW, Norman, OK, 73019, US
    35.207487 -97.447906
  • Library Service CenterBorrow it
    2601 Technology Place, Norman, OK, 73019, US
    35.185561 -97.398361
  • Price College Digital LibraryBorrow it
    Adams Hall 102 307 West Brooks St., Norman, OK, 73019, US
    35.210371 -97.448244
  • Western History CollectionsBorrow it
    Monnet Hall 630 Parrington Oval, Rm. 300, Norman, OK, 73019, US
    35.209584 -97.445414
Processing Feedback ...