Fix Python – Search for “does-not-contain” on a DataFrame in pandas

Question

Asked By – stites

I’ve done some searching and can’t figure out how to filter a dataframe by

df["col"].str.contains(word)

however I’m wondering if there is a way to do the reverse: filter a dataframe by that set’s compliment. eg: to the effect of

!(df["col"].str.contains(word))

Can this be done through a DataFrame method?

Now we will see solution for issue: Search for “does-not-contain” on a DataFrame in pandas


Answer

You can use the invert (~) operator (which acts like a not for boolean data):

new_df = df[~df["col"].str.contains(word)]

where new_df is the copy returned by RHS.

contains also accepts a regular expression…


If the above throws a ValueError, the reason is likely because you have mixed datatypes, so use na=False:

new_df = df[~df["col"].str.contains(word, na=False)]

Or,

new_df = df[df["col"].str.contains(word) == False]

This question is answered By – Andy Hayden

This answer is collected from stackoverflow and reviewed by FixPython community admins, is licensed under cc by-sa 2.5 , cc by-sa 3.0 and cc by-sa 4.0