Pandas COUNTIF based on column value

Question

Pandas COUNTIF based on column value

I am trying to essentially do COUNTIF in pandas to count how many items in a row match the number in the first column.

Dataframe:

So, I want to count the instances in line (b, c, d) that match a. Line 1, for example, should be 1, since only d matches a.

I searched a bit for this, but so far only found examples where its total (e.g. counting all values greater than 0) but not based on the dataframe column. I am guessing its some form of logic that masks based on the column but df == df.a

doesn't seem to work

+3

python pandas dataframe

kmccarty 04 Apr 17 at 20:24

source to share

2 answers

df.apply(lambda x: (x == x[0]).sum()-1,axis=1)

+3

Scott boston 04 Apr 17 at 20:40

source to share

Psidom · Accepted Answer · 2017-04-04T20:45:02+0000

You can use eq

which you can pass to a parameter axis

to indicate the direction of the comparison, then you can do the sum of the string to count the number of values matched:

df.eq(df.a, axis=0).sum(1) - 1

#0    1
#1    1
#2    1
#dtype: int64

Pandas COUNTIF based on column value

More articles: