Skip to content

Instantly share code, notes, and snippets.

@yutannihilation
Created April 13, 2017 12:37
Show Gist options
  • Select an option

  • Save yutannihilation/9df8053875f88d9954aab3a9cc22900c to your computer and use it in GitHub Desktop.

Select an option

Save yutannihilation/9df8053875f88d9954aab3a9cc22900c to your computer and use it in GitHub Desktop.
--- introduction.Rmd 2017-04-13 21:34:20.746708500 +0900
+++ dplyr.Rmd 2017-04-13 21:32:54.691292800 +0900
@@ -1,6 +1,5 @@
---
title: "Introduction to dplyr"
-date: "`r Sys.Date()`"
output: rmarkdown::html_vignette
vignette: >
%\VignetteIndexEntry{Introduction to dplyr}
@@ -13,6 +12,7 @@
options(tibble.print_min = 4L, tibble.print_max = 4L)
library(dplyr)
library(ggplot2)
+set.seed(1014)
```
When working with data you must:
@@ -233,7 +233,7 @@
# Grouped operations
-These verbs are useful on their own, but they become really powerful when you apply them to groups of observations within a dataset. In dplyr, you do this by with the `group_by()` function. It breaks down a dataset into specified groups of rows. When you then apply the verbs above on the resulting object they'll be
automatically applied "by group". Most importantly, all this is achieved by using the same exact syntax you'd use with an ungrouped object.
+These verbs are useful on their own, but they become really powerful when you apply them to groups of observations within a dataset. In dplyr, you do this with the `group_by()` function. It breaks down a dataset into specified groups of rows. When you then apply the verbs above on the resulting object they'll be au
tomatically applied "by group". Most importantly, all this is achieved by using the same exact syntax you'd use with an ungrouped object.
Grouping affects the verbs as follows:
@@ -368,13 +368,7 @@
## Databases
-dplyr also allows you to use the same verbs with a remote database. It takes care of generating the SQL for you so that you can avoid the cognitive challenge of constantly switching between languages. See the databases vignette for more details.
-
-Compared to DBI and the database connection algorithms:
-
-* it hides, as much as possible, the fact that you're working with a remote database
-* you don't need to know any SQL (although it helps!)
-* it abstracts over the many differences between the different DBI implementations
+dplyr also allows you to use the same verbs with a remote database. It takes care of generating the SQL for you so that you can avoid the cognitive challenge of constantly switching between languages. To use these capabilities, you'll need to install the dbplyr package and then read `vignette("dbplyr")` for the det
ails.
## Multidimensional arrays / cubes
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment