Skip to content

Conversation

@cloud-fan
Copy link
Contributor

This patch adds the release notes and relevant announcements for Spark 2.4.0.

The first commit just adds the docs of the latest RC, please review the second commit.

Release Notes: https://github.com/cloud-fan/spark-website/blob/62847a7d4b106ad6ac1aee2016e226b56c7e9d1a/releases/_posts/2018-11-05-spark-release-2-4-0.md

@cloud-fan
Copy link
Contributor Author

Copy link
Member

@felixcheung felixcheung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it's too many to list or review on github - I reviewed those not under site/docs, LGTM

- **Major features**
- **Barrier Execution Mode**: [[SPARK-24374](https://issues.apache.org/jira/browse/SPARK-24374)] Support Barrier Execution Mode in the scheduler, to better integrate with deep learning frameworks.
- **Scala 2.12 Support**: [[SPARK-14220](https://issues.apache.org/jira/browse/SPARK-14220)] Add experimental Scala 2.12 support. Now you can build Spark with Scala 2.12 and write Spark applications in Scala 2.12.
- **Higher-order functions**: [[SPARK-23899](https://issues.apache.org/jira/browse/SPARK-23899)] Add a lof of new built-in functions, including high-order functions, to deal with complex data types easier.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lof -> lot

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

also higher-order instead of high-order?


- **PySpark**
- [[SPARK-24215](https://issues.apache.org/jira/browse/SPARK-24215)] Implement eager evaluation for DataFrame APIs
- [[SPARK-22274](https://issues.apache.org/jira/browse/SPARK-22274)] User-defined aggregation functions with pandas UDF
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: pandas -> Pandas?

- **PySpark**
- [[SPARK-24215](https://issues.apache.org/jira/browse/SPARK-24215)] Implement eager evaluation for DataFrame APIs
- [[SPARK-22274](https://issues.apache.org/jira/browse/SPARK-22274)] User-defined aggregation functions with pandas UDF
- [[SPARK-22239](https://issues.apache.org/jira/browse/SPARK-22239)] User-defined window functions with pandas UDF
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto


### Credits
Last but not least, this release would not have been possible without the following contributors:
Achuth17, Adam Bradbury, Adamyuanyuan, Adelbert Chang, Ala Luszczak, Aleksandr Koriagin, Alessandro Bellina, Alessandro Solimando, Andrew Korzhuev, Anton Okolnychyi, Antonio Murgia, Arseniy Tashoyan, Artem Rudoy, Arun Mahadevan, Asher Saban, Bago Amirbekian, Benjamin Peterson, Bo Meng, Bogdan Raducanu, Bounkong Khamphousone, Brandon Krieger, Brian Lindblom, Bruce Robbins, Bryan Cutler, Cheng Lian, Chongguang LIU, Chris Horn, Chris Martin, Cody Koeninger, DB Tsai, Daniel Sakuma, Daniel Van Der Ende, Darcy Shen, David Vogelbacher, Devaraj K, Dhruve Ashar, Dilip Biswal, Dongjoon Hyun, DylanGuedes, Efim Poberezkin, Eric Chang, Eric Liang, Erik Erlandson, Eyal Farago, Fangshi Li, Felix Cheung, Feng Liu, Fernando Pereira, Florent Pepin, Fokko Driesprong, Gabor Somogyi, Gengliang Wang, Ger Van Rossum, Gera Shegalov, Goun Na, Hao Ren, Henry Robinson, Herman Van Hovell, Hieu Huynh, Holden Karau, Huang Tengfei, Huaxin Gao, Hyukjin Kwon, Ilan Filonenko, Imran Rashid, Jacek Laskowski, Jake Charland, James Thompson, James Yu, Jaroslav Chladek, Jeff Zhang, JiahuiJiang, Jim Kleckner, Joey Krabacher, John Zhuge, Jongyoul Lee, Jooseong Kim, Jose Torres, Joseph Bradley, Joseph K. Bradley, Josh Rosen, Julien Cuquemelle, Juliusz Sompolski, Jungtaek Lim, KaiXinXIaoLei, Kallman, Steven, Karthik Palaniappan, Kaya Kupferschmidt, Kazuaki Ishizaki, Kelley Robinson, Kent Yao, Kevin Yu, KevinZwx, Koert Kuipers, Kousuke Saruta, Kris Mok, LantaoJin, Lee Dongjin, Lemonjing, Li Jin, Liang-Chi Hsieh, Lu WANG, LucaCanali, Marcelo Vanzin, Marco Gaido, Marek Novotny, Mario Molina, Mark Petruska, Maryann Xue, Mathieu St-Louis, Matthew Cheah, Matthew Tovbin, Mauro Palsgraaf, Maxim Gekk, Michael (Stu) Stewart, Michael Allman, Michael Chirico, Michael Mior, Michal Switakowski, Mihaly Toth, Miklos C, Miles Yucht, Misha Dmitriev, Mukul Murthy, Mykhailo Shtelma, Neal Song, Ngone51, Nihar Sheth, Nolan Emirot, Norman Maurer, Onur Satici, Patrick McGloin, Patrick Pisciuneri, Paul Mackles, Peter Toth, Prashant Sharma, Rao Fu, Ray Burgemeestre, Rekha Joshi, Reynold Xin, Reza Safi, Ricardo Martinelli De Oliveira, Rob Vesse, Robert Kruszewski, Rong Tang, Ryan Blue, Sahil Takiar, Saisai Shao, Sandeep Singh, Sandor Murakozi, Sanket Chintapalli, Santiago Saavedra, Sean Owen, Sean Suchter, Sebastian Arzt, Shane Knapp, Shixiong Zhu, Soham Aurangabadkar, Stacy Kerkela, Stan Zhai, Stavros Kontopoulos, Steve Loughran, Sunitha Kambhampati, Takeshi Yamamuro, Takuya UESHIN, Tathagata Das, Ted Yu, Teng Peng, Thiruvasakan Paramasivan, Thomas Graves, Tom Saleeba, Vayda, Oleksandr: IT (PRG), Vinod KC, Vladimir Kuriatkov, Wang Gengliang, Weichen Xu, Wenbo Zhao, Wenchen Fan, William Sheu, XD-DENG, Xiangrui Meng, Xianjin YE, Xianyang Liu, Xiao Li, Xiaogang Tu, Xiayun Sun, Xingbo Jiang, Yacine Mazari, Yash Sharma, Ye Zhou, Yinan Li, Yogesh Garg, Yuanbo Liu, Yuanjian Li, Yuchen Huo, Yuexin Zhang, Yuming Wang, Yuval Itzchakov, Zhan Zhang, Zhang Le, Zheng RuiFeng, Zoltan C. Toth No newline at end of file
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kiszk
Copy link
Member

kiszk commented Nov 5, 2018

Here is a set of output by misspell. I think that Euclidian is intentionally used while I am not familiar with ML.

$ spark-website > ../../bin/misspell site/docs/2.4.0/
site/docs/2.4.0/api/java/org/apache/spark/ExecutorPlugin.html:110:27: "compatability" is a misspelling of "compatibility"
site/docs/2.4.0/api/java/org/apache/spark/ml/stat/Summarizer.html:312:16: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/java/org/apache/spark/ml/stat/Summarizer.html:345:16: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/java/org/apache/spark/mllib/clustering/EMLDAOptimizer.html:130:34: "Teh" is a misspelling of "The"
site/docs/2.4.0/api/java/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.html:220:23: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/java/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.html:401:23: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/java/org/apache/spark/storage/ReadableChannelFileRegion.html:190:36: "transfered" is a misspelling of "transferred"
site/docs/2.4.0/api/python/_modules/pyspark/ml/stat.html:375:40: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/scala/lib/scheduler.js:62:41: "existant" is a misspelling of "existent"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:2312:4: "dependancy" is a misspelling of "dependency"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:2629:46: "Destory" is a misspelling of "Destroy"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:4767:4: "dependancies" is a misspelling of "dependencies"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:4768:4: "dependancies" is a misspelling of "dependencies"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:5123:4: "dependancies" is a misspelling of "dependencies"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:5404:4: "dependancies" is a misspelling of "dependencies"
site/docs/2.4.0/api/scala/lib/jquery.layout.js:5434:58: "occassionally" is a misspelling of "occasionally"
site/docs/2.4.0/api/python/searchindex.js:1:138287: "composit" is a misspelling of "compost"
site/docs/2.4.0/api/python/searchindex.js:1:138596: "concret" is a misspelling of "concert"
site/docs/2.4.0/api/python/searchindex.js:1:139122: "convers" is a misspelling of "converts"
site/docs/2.4.0/api/python/searchindex.js:1:141263: "decomposit" is a misspelling of "decompose"
site/docs/2.4.0/api/python/searchindex.js:1:144690: "euclidian" is a misspelling of "euclidean"
site/docs/2.4.0/api/python/searchindex.js:1:152810: "includ" is a misspelling of "include"
site/docs/2.4.0/api/python/searchindex.js:1:163447: "occurr" is a misspelling of "occur"
site/docs/2.4.0/api/python/searchindex.js:1:166110: "portugues" is a misspelling of "portuguese"
site/docs/2.4.0/api/scala/org/apache/spark/ml/stat/Summarizer$.html:722:839: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/scala/org/apache/spark/mllib/clustering/EMLDAOptimizer.html:55:33: "Teh" is a misspelling of "The"
site/docs/2.4.0/api/scala/org/apache/spark/mllib/clustering/package.html:173:33: "Teh" is a misspelling of "The"
site/docs/2.4.0/api/scala/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.html:533:38: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/scala/org/apache/spark/mllib/stat/MultivariateOnlineSummarizer.html:533:133: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/sql/css/theme.css:12:16937: "adn" is a misspelling of "and"
site/docs/2.4.0/api/sql/js/highlight.pack.js:1:11462: "lenght" is a misspelling of "length"
site/docs/2.4.0/api/sql/js/highlight.pack.js:1:92171: "seperate" is a misspelling of "separate"
site/docs/2.4.0/api/sql/js/highlight.pack.js:2:13040: "recieved" is a misspelling of "received"
site/docs/2.4.0/api/python/pyspark.ml.html:32824:16: "Euclidian" is a misspelling of "Euclidean"
site/docs/2.4.0/api/java/index-all.html:29454:23: "Euclidian" is a misspelling of "Euclidean"
$ spark-website > ../../bin/misspell release-process.md 
release-process.md:198:30: "maintance" is a misspelling of "maintenance"

Otherwise, we could address them if possible (not related to 2.4 release). I am neutral on them.

$ spark-website > ../../bin/misspell README.md _* [a-q]* releases screencasts security.md sitemap.xml sql streaming [t-z]*
news/_posts/2014-09-11-spark-1-1-0-released.md:16:451: "libary" is a misspelling of "library"
news/_posts/2016-03-09-spark-1-6-1-released.md:14:232: "signficant" is a misspelling of "significant"
releases/_posts/2014-08-05-spark-release-1-0-2.md:38:20: "consistant" is a misspelling of "consistent"
releases/_posts/2015-06-11-spark-release-1-4-0.md:28:82: "compatiblity" is a misspelling of "compatibility"
releases/_posts/2013-09-25-spark-release-0-8-0.md:16:159: "pacakge" is a misspelling of "package"
releases/_posts/2015-03-13-spark-release-1-3-0.md:19:603: "Finaly" is a misspelling of "Finally"
releases/_posts/2017-07-11-spark-release-2-2-0.md:68:13: "Availablity" is a misspelling of "Availability"

@cloud-fan
Copy link
Contributor Author

@kiszk thanks for checking them! Can you open a JIRA ticket to fix the typos in the doc? These typos need to be fixed in the spark repo, and they can be available in 2.4.1

For other typos that are not related to 2.4.0, feel free to open a PR to fix them. But I'm not sure if we can still update previous posts.

@kiszk
Copy link
Member

kiszk commented Nov 5, 2018

Since the above problem exists before 2.4, it is ok to fix now or to fix in another PR.

@kiszk
Copy link
Member

kiszk commented Nov 5, 2018

Got it. Since most of file are generated from Spark repo or derived from other libs, it would be good to address in other PRs. We would appreciate it if this PR could address the following long-existing problem.

$ spark-website > ../../bin/misspell release-process.md 
release-process.md:198:30: "maintance" is a misspelling of "maintenance"

@cloud-fan
Copy link
Contributor Author

@kiszk fixed at #157

@srowen
Copy link
Member

srowen commented Nov 5, 2018

@kiszk yeah this should be a pull request against apache/spark to fix typos. That's great! I didn't know about misspell. EDIT: I just opened a PR for the rest at apache/spark#22950

asfgit pushed a commit to apache/spark that referenced this pull request Nov 5, 2018
## What changes were proposed in this pull request?

Fix typos and misspellings, per apache/spark-website#158 (comment)

## How was this patch tested?

Existing tests.

Closes #22950 from srowen/Typos.

Authored-by: Sean Owen <[email protected]>
Signed-off-by: Sean Owen <[email protected]>
(cherry picked from commit c0d1bf0)
Signed-off-by: Sean Owen <[email protected]>
js/downloads.js Outdated
// 2.4.0+
var packagesV9 = [hadoop2p7, hadoop2p6, hadoopFree, scala2p12_hadoopFree, sources];

addRelease("2.4.0", new Date("11/05/2018"), packagesV9, true, true);
Copy link
Member

@dongjoon-hyun dongjoon-hyun Nov 8, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it better to change this since the announcement is delayed? Also, file name also the old date.

news/_posts/2018-11-05-spark-2-4-0-released.md

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I checked 2.3.0, the date is when the vote passed. Let me update.

@gatorsmile
Copy link
Member

LGTM

Thanks! Merged to master.

@asfgit asfgit merged commit 3726c32 into apache:asf-site Nov 8, 2018
jackylee-ch pushed a commit to jackylee-ch/spark that referenced this pull request Feb 18, 2019
## What changes were proposed in this pull request?

Fix typos and misspellings, per apache/spark-website#158 (comment)

## How was this patch tested?

Existing tests.

Closes apache#22950 from srowen/Typos.

Authored-by: Sean Owen <[email protected]>
Signed-off-by: Sean Owen <[email protected]>
kai-chi pushed a commit to kai-chi/spark that referenced this pull request Jul 23, 2019
## What changes were proposed in this pull request?

Fix typos and misspellings, per apache/spark-website#158 (comment)

## How was this patch tested?

Existing tests.

Closes apache#22950 from srowen/Typos.

Authored-by: Sean Owen <[email protected]>
Signed-off-by: Sean Owen <[email protected]>
(cherry picked from commit c0d1bf0)
Signed-off-by: Sean Owen <[email protected]>
kai-chi pushed a commit to kai-chi/spark that referenced this pull request Aug 1, 2019
## What changes were proposed in this pull request?

Fix typos and misspellings, per apache/spark-website#158 (comment)

## How was this patch tested?

Existing tests.

Closes apache#22950 from srowen/Typos.

Authored-by: Sean Owen <[email protected]>
Signed-off-by: Sean Owen <[email protected]>
(cherry picked from commit c0d1bf0)
Signed-off-by: Sean Owen <[email protected]>
zhongjinhan pushed a commit to zhongjinhan/spark-1 that referenced this pull request Sep 3, 2019
## What changes were proposed in this pull request?

Fix typos and misspellings, per apache/spark-website#158 (comment)

## How was this patch tested?

Existing tests.

Closes #22950 from srowen/Typos.

Authored-by: Sean Owen <[email protected]>
Signed-off-by: Sean Owen <[email protected]>
(cherry picked from commit c0d1bf0)
Signed-off-by: Sean Owen <[email protected]>
(cherry picked from commit 8526f2e)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants