merge                  package:base                  R Documentation

_M_e_r_g_e _T_w_o _D_a_t_a _F_r_a_m_e_s

_D_e_s_c_r_i_p_t_i_o_n:

     Merge two data frames by common columns or row names.

_U_s_a_g_e:

     merge(x, y, by, by.x, by.y, sort = TRUE)

_A_r_g_u_m_e_n_t_s:

    x, y: data frames, or objects to be coerced to one

by, by.x, by.y: specifcations of the common columns. See Details.

    sort: logical. Should the results be sorted on the `by' columns?

_D_e_t_a_i_l_s:

     By default the data frames are merged on the columns with names
     they both have, but separate specifcations of the columns can be
     given by `by.x' and `by.y'. Columns can be specified by name,
     number or by a logical vector: the name `"row.names"' or the
     number `0' specifies the row names.  The rows in the two data
     frames that match on the specified columns are extracted, and
     joined together. If there is more than one match, all possible
     matches contribute one row each.

     If the remaining columns in the data frames have any common names,
     these have `".x"' and `".y"' appended to make the names of the
     result unique.

_V_a_l_u_e:

     A data frame. The rows are by default lexicographically sorted on
     the common columns, but are otherwise in the order in which they
     occurred in `x'. The columns are the common columns followed by
     the remaining columns in `x' and then those in `y'.  If the
     matching involved row names, an extra column `Row.names' is added
     at the left, and in all cases the result has no special row names.

_S_e_e _A_l_s_o:

     `data.frame', `by', `cbind'

_E_x_a_m_p_l_e_s:

     authors <- data.frame(
         surname = c("Tukey", "Venables", "Tierney", "Ripley", "McNeil"),
         nationality = c("US", "Australia", "US", "UK", "Australia"),
         retired = c("yes", rep("no", 4)))
     books <- data.frame(
         name = c("Tukey", "Venables", "Tierney", "Ripley", "Ripley", "McNeil"),
         title = c("Exploratory Data Analysis",
                   "Modern Applied Statistics ...",
                   "LISP-STAT",
                   "Spatial Statistics", "Stochastic Simulation",
                    "Interactive Data Analysis"),
         other.author = c(NA, "Ripley", NA, NA, NA, NA))

     merge(authors, books, by.x="surname", by.y="name")
     merge(books, authors, by.x="name", by.y="surname")

