Waste Water Treatment

The wwt submodule contains data from approximately 20,000 experiments focused on the removal of various contaminants from wastewater using treatment strategies such as adsorption, photocatalysis, membrane filtration, and sonolysis. This submodule provides a unified interface to access all this data, which is scattered across the literature, in a standardized format using a few Python functions. It is important to note that we do not introduce this data since this data has already been utilized and analyzed in various peer-reviewed scientific publications. However, we offer a simple and easy-to-use interface to access this existing data. The availability of such a large corpus of experimental data can significantly aid in data-driven modeling and material discovery. A summary of these datasets is provided in following table.

List of datasets

Summary of datasets
Treatment Process	Function Name	Parameters	Target Pollutant	Data Points	Reference
`Adsorption`	`aqua_fetch.ec_removal_biochar()`	26	Emerg. Contaminants	3,757	Jaffari et al., 2023
`Adsorption`	`aqua_fetch.cr_removal()`	15	Cr	219	Ishtiaq et al., 2024
`Adsorption`	`aqua_fetch.heavy_metal_removal()`	30	heavy metals	1518	Jaffari et al., 2023
`Adsorption`	`aqua_fetch.po4_removal_biochar()`	30	po4	5014	Iftikhar et al., 2024
`Adsorption`	`aqua_fetch.industrial_dye_removal()`	12	Industrial Dye	1514	Iftikhar et al., 2023
`Adsorption`	`aqua_fetch.heavy_metal_removal_Shen()`	17	Heavy Metals	689	Shen et al., 2023
`Adsorption`	`aqua_fetch.P_recovery()`	8	P	504	Leng et al., 2024
`Adsorption`	`aqua_fetch.N_recovery()`	8	N	211	Leng et al., 2024
`Adsorption`	`aqua_fetch.As_recovery()`	13	As	1605	Huang et al., 2024
`Photocatalysis`	`aqua_fetch.mg_degradation()`	11	Melachite Green	1200	Jaffari et a., 2023
`Photocatalysis`	`aqua_fetch.dye_removal()`	23	Dyes	1527	Kim et al., 2024
`Photocatalysis`	`aqua_fetch.dichlorophenoxyacetic_acid_removal()`	15	2,4,Dichlorophenoxyacetic acid	1044	Kim et al., 2024
`Photocatalysis`	`aqua_fetch.pms_removal()`			2078	submitted et al., 2024
`Photocatalysis`	`aqua_fetch.tetracycline_degradation()`	8	Tetracycline	374	Abdi et al., 2022
`Photocatalysis`	`aqua_fetch.tio2_degradation()`	7	TiO2	446	Jiang et al., 2020
`Photocatalysis`	`aqua_fetch.photodegradation_Jiang()`	8	multiple	457	Jiang et al., 2021
`Membrane`	`aqua_fetch.micropollutant_removal_osmosis()`	18	micropollutants	1906	Jeong et al., 2021
`sonolysis`	`aqua_fetch.cyanobacteria_disinfection()`	6	Cyanobacteria	314	Jaffari et al., 2024

Adsorption

aqua_fetch.ec_removal_biochar(parameters: str | List[str] = 'all', encoding: str = None) → Tuple[DataFrame, Dict[str, OneHotEncoder | LabelEncoder | Any]][source]

Data of removal of emerging contaminants/pollutants from wastewater using biochar. The data consists of three types of features, 1) adsorption experimental conditions, 2) elemental composition of adsorbent (biochar) and 3) parameters representing physical and synthesis conditions of biochar. For more description of this data see Jaffari et al., 2023

Parameters:

parameters –
By default following features are used as input
- adsorbent
- pyrolysis_temperature
- pyrolysis_time
- C
- H
- O
- N
- (O+N)/C
- ash
- H/C
- O/C
- N/C
- surface_area
- pore_volume
- average_pore_size
- pollutant
- adsorption_time
- concentration
- Solution_ph
- rpm
- volume
- adsorbent_dosage
- adsorption_temperature
- ion_concentration
- humid_acid
- wastewater_type
- adsorption_type
- final_concentration
- capacity
encoding (str, default=None) – the type of encoding to use for categorical features. If not None, it should be either ohe or le.

Returns:

A tuple of length two. The first element is a DataFrame while the second element is a dictionary consisting of encoders with adsorbent pollutant, wastewater_type and adsorption_type as keys.

Return type:

Waste Water Treatment

List of datasets

Adsorption

Photocatalysis

Membrane

Sonolysis