findSingleCellDuplicates - findSingleCellDuplicates
Description¶
Finds sequences that share the same identifier and sequence between groups.
Usage¶
findSingleCellDuplicates(
db,
fields,
cell_id = "cell_id",
seq = "sequence_alignment",
sequence_id = "sequence_id"
)
Arguments¶
- db
- data.frame with AIRR-format style columns.
- fields
- Columns in
db, in addition tosample_id, that should be used to group sequences to be analyzed independently. - cell_id
- column in
dbcontaining cell identifiers - seq
- column in
dbcontaining sequences to be compared - sequence_id
- column in
dbcontaining sequence identifiers
Value¶
A list with fields:
dups: a data.frame with the columnsc_duplicatewith values TRUE/FALSE to indicate whether the the row corresponds to a duplicated entry.fields: a data.frame showing the input fields usedcell_id: column indbcontaining cell identifiersseq: column indbcontaining sequence datasequence_idcolumn indbcontainin sequence identifiers