首页 > TAG信息列表 > Definitive
《Spark -- The Definitive Guide》学习笔记Note.2
上一期中,通过翻译与凝练《Spark – The Definitive Guide》,我们初步解决了Spark是什么的问题,今天我们将继续学习Spark的基本架构,应用程序,结构化API,核心术语与概念,分享过程中若有错谬,欢迎拍砖。 Charpter 2.A Gentle Introduction to Spark Spark’s Basic ArchitectureThe Definitive ANTLR 4 Reference
原文:https://github.com/joaoBordalo/feup-COMP/blob/master/The Definitive ANTLR 4 Reference.pdf ANTLR V4介绍 新的解析技术:Adaptive LL() or ALL() 简化对expression的语法表达 自动将左递归规则重写为等价的非左递归规则 parse-tree walker: listener and visitor pattern《Presto(Trino)——The Definitive Guide》CHAPTER 6 Connectors
Presto(暂译:普雷斯托)系列文章目录-CHAPTER 6 Connectors This book provides a great introduction to Presto and teaches you everything you need to know to start your successful usage of Presto. —Dain Sundstrom and David Phillips, Creators of the Presto ProjecElasticsearch: The Definitive Guide Chapter 2 Life inside a Cluster
Add an Index In reality, an index is just a logical namespace that points to one or more pyhsical shards. A shard is a single instance of Lucence, and is a complete search engine in its own right. Coping with FailureKafka:The Definitive Guide learning
Kafka utilizes Zoookeeper for storing metadata information about the brokers, topics, and partititions. Producer Sending a Message to Kafka Sending a Message Asynchronously Configuring Producers acks =1, all Chapter 4 Kafka Consumers: ReadiSpark:The Definitive Book第十四章笔记
In addition to the Resilient Distributed Dataset (RDD) interface, the second kind of low-level API in Spark is two types of “distributed shared variables”: broadcast variables and accumulators. These are variables you can use in your user-defined functionSpark:The Definitive Book第六章笔记
Where to Look for APIs DataFrame本质上是类型为Row的DataSet,需要多看https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.sql.Dataset来发现API的更新。 DataFrameStatFunctions与DataFrameNaFunctions在解决特定问题上有更多的方法。 DataFrameStatF