How to use pandas groupby multiple columns?

404 Asked by DavidEDWARDS in Data Science , Asked on Feb 14, 2023

I have a data frame which contains duplicates I'd like to combine based on 1 column (name). In half of the other columns I'd like to keep one value (as they should all be the same) whereas I'd like to sum the others.

I've tried the following code based on an answer I found here: Pandas merge column duplicate and sum value

df2 = df.groupby(['name']).agg({'address': 'first', 'cost': 'sum'}
The only issue is I have 100 columns, so would rather not list them all out. Is there a way to pass a tuple or list in the place of 'address' and 'cost' above? Something along the lines of
column_list = df.columns.values.tolist()
columns_first = tuple(column_list[0:68])
columns_sum = tuple(column_list[68:104])

Answered by David EDWARDS

To use pandas groupby multiple columns, you could perhaps generate the dictionary using a list comprehension style syntax. E.g.

df2 = df.groupby(['name']).agg({col: 'first' if i


            
               Your Answer
            
                           
                  
                  
                                          
                                                                                                     
                     
                        
                        
                     
                                                                                       
                           
                           
                           Email me when someone reply to thread


         
         
         
         
         

	Categories
	
		
			
									
						 Salesforce (1353) 													
																	
											Salesforce Lightning (25)
																			
																	
											Development (82)
																			
															
											
									
													Business Analyst (260)
																	
									
						 QA Testing (438) 													
																	
											Manual Testing (45)
																			
																	
											Automation Testing (71)
																			
																	
											Selenium (44)
																			
															
											
									
													AWS (427)
																	
									
													SQL Server (1374)
																	
									
						 Data Science (766) 													
																	
											Machine Learning (122)
																			
																	
											Natural Language Processing (117)
																			
																	
											Deep Learning (2)
																			
																	
											R (123)
																			
															
											
									
						 Devops (520) 													
																	
											Ansible (4)
																			
																	
											Docker (20)
																			
																	
											Nagios (27)
																			
																	
											Git (27)
																			
																	
											Maven (4)
																			
																	
											Linux (26)
																			
																	
											kubernetes (16)
																			
															
											
									
													Tableau (218)
																	
									
													Big Data Hadoop (35)
																	
									
						 Python (716) 													
																	
											Angular (36)
																			
																	
											HTML (9)
																			
																	
											Module (24)
																			
															
											
									
													Java (627)
																	
									
													Business Intelligence (8)
																	
									
													Cyber Security (836)
																	
									
													Power BI (22)
																	
									
													Spark (12)
																	
									
													Web-development (63)
																	
									
													Artificial intelligence (75)
																	
									
													Android App Development (7)
																	
									
													azure (12)
																	
									
													Digital Marketing (12)
																	
							
		
	
	
		
			Download Free eBooks
		
				
		
	
	
		
			
				Demo Classes Available			
			
		
	
	
		
			
			JanBask
eSchool