The dplyr
package is quickly becoming the de-facto standard for common tasks that involve manipulating data frames. Some of its biggest advantages are:
- A streamlined and unified interface for tasks commonly performed together.
- More syntactically and semantically intuitive functions than the tools in base R
- Often much faster!
- The ability to manipulate data stored in databases (e.g. MySQL, SQLite, PostgreSQL) with the same language you use for 'ordinary' data in R (i.e., data frames).
This is not to say that it replaces or makes base R syntax obsolete, but it is a great tool to have at your disposal.