Datasets
Permanent URI for this collectionhttps://dspace.nm-aist.ac.tz/handle/123456789/3197
Browse
Recent Submissions
Item Dataset: Assessment of heavy metals in soil and water from Bahi district, Tanzania(PLOS One, 2025-06-11) Sumary, Dominic; Raymond, Jofrey; Chacha, Musa; Banzi, Firmi; Gomezulu, EdwinCorrelation coefficient (r) matrix for heavy metal and with metal oxides concentrations in soil samples.Item Dataset: Heterogeneous impacts for malaria control from larviciding across villages and considerations for monitoring and evaluation(PLOS Pathogens, 2025-07-28) Sherrard-Smith, Ellie; Fillinger, Ulrike; Tia, Jean-Philippe; Winskill, Peter; Koudou, Benjamin; Tchicaya, Emile; Sanou, Antoine; Okumu, Fredros; Opiyo, Mercy; Majambere, Silas; Hamlet, Arran; Charles, Giovanni; Lambert, Ben; Churcher, ThomasThe empirical estimates and derived parameters used to calibrate the transmission model for villages testing nets or nets and larviciding in the Kenyan trial [34]. Crude estimates for the total reduction in adult mosquitoes at the village level, the global estimates of species reductions according to the GLMM analysis (Eqs 4–7). The relative species compositions in each village that are used to weight the percentage reductions in adult densities simulated in the transmission model, the village-level statistically estimated ranges in reductions in vectors, the baseline prevalences as measured using microscopy in the baItem Dataset: Population status of leopard in one of Africa’s largest wilderness areas and the challenge of monitoring at scale(Dryad, 2025-11-17) Searle, Charlotte; Strampelli, Paolo; Parsais, Singira; Haule, Leonard; Olesyapa, Kandey; Salum, Nasri; Mtoka, Samuel; Hape, Germanus; Mathayo, Daniel; Elisa, Manase; Lobora, Alex; Dickman, AmyRemarkably little is still understood about how the leopard (Panthera pardus) is faring in much of its remaining African range, despite the species’ importance for ecosystem function and generating funding for conservation via tourism. In this study, we address this knowledge gap in southern Tanzania’s Selous-Nyerere ecosystem, one of the largest intact wilderness areas on the continent, by estimating leopard population density via spatially explicit capture-recapture (SECR) modelling of data from seven camera trap surveys. Population density was highest in Nyerere National Park’s Matambwe sector (8.08 ± SE 1.54 adult and subadult leopards per 100 km2), followed by Selous Game Reserve’s Miguruwe sector (7.38 ± 1.26 per 100 km2); Nyerere NP’s Msolwa sector (6.05 ± 0.78 per 100 km2); Selous GR’s Liwale sector (5.93 ± 0.88 per 100 km2), western Kingupira sector (5.58 ± 0.87 per 100 km2), and eastern Kingupira sector (5.22 ± 0.71 per 100 km2); and Nyerere NP’s Kalulu sector (3.80 ± 0.64 per 100 km2). Together, our surveys covered an important component of extant leopard range in Tanzania, and our findings highlight the importance of the Selous-Nyerere ecosystem as a leopard stronghold. The estimates include the highest leopard densities yet documented in miombo woodland, which represents nearly one fifth of the species’ remaining African range. Unlike lion, leopard population density was highly correlated with relative abundance of preferred prey. Although limited by a small number of data points, this suggests that the two species may not be uniformly affected by anthropogenic threats. Threats to leopard in Selous-Nyerere include accelerating habitat conversion in boundary areas and bushmeat poaching, which impacts leopard indirectly by suppressing prey populations and directly via accidental snaring. Practical implication: Placed in the context of range-wide leopard monitoring, this study highlights the need to address persistent knowledge gaps on the species’ continental status and prioritise sites for monitoring based on their potential to inform evidence-based conservation management.Item Dataset: Replication Data & Code - Large-scale land acquisitions exacerbate local land inequalities in Tanzania(Proceedings of the National Academy of Sciences, 2023-07-31) Sullivan, Jonathan; Samii, Cyrus; Brown, Daniel; Moyo, Francis; Agrawal, ArunLand inequality stalls economic development, entrenches poverty, and is associated with environmental degradation. Yet, rigorous assessments of land-use interventions attend to inequality only rarely. A land inequality lens is especially important to understand how recent large-scale land acquisitions (LSLAs) affect smallholder and indigenous communities across as much as 100 million hectares around the world. This paper studies inequalities in land assets, specifically landholdings and farm size, to derive insights into the distributional outcomes of LSLAs. Using a household survey covering four pairs of land acquisition and control sites in Tanzania, we use a quasi-experimental design to characterize changes in land inequality and subsequent impacts on well-being. We find convincing evidence that LSLAs in Tanzania lead to both reduced landholdings and greater farmland inequality among smallholders. Households in proximity to LSLAs are associated with 21.1% (P = 0.02) smaller landholdings while evidence, although insignificant, is suggestive that farm sizes are also declining. Aggregate estimates, however, hide that households in the bottom quartiles of farm size suffer the brunt of landlessness and land loss induced by LSLAs that combine to generate greater farmland inequality. Additional analyses find that land inequality is not offset by improvements in other livelihood dimensions, rather farm size decreases among households near LSLAs are associated with no income improvements, lower wealth, increased poverty, and higher food insecurity. The results demonstrate that without explicit consideration of distributional outcomes, land-use policies can systematically reinforce existing inequalities.Item Dataset: Using dung densities to assess the ecological effectiveness of a protected area network(Zenodo, 2024-03-05) Giliba, Richard; Kiffner, Christian; Fust, Pascal; Loos, JacquelineGiven recent global endeavors to increase protected area coverage, it is crucial to comprehensively evaluate the efficacy of various area-based conservation strategies in effectively reducing biodiversity loss. Here, we investigated responses of wildlife populations to different protection levels and environmental variables at the landscape scale in the Katavi-Rukwa Ecosystem, western Tanzania. To this end, we conducted line distance sampling surveys and counted dung of six target mammal species (elephant, giraffe, buffalo, zebra, topi, hartebeest) along foot transects within areas differing in protection levels (from strict to less-strictly protected: national park, game reserve, forest reserve, game-controlled area, and unprotected areas). Based on these dung counts, we modelled the spatial distribution of these six mammal species using a species-specific density surface modelling framework. We, found consistent effects of protection level and land-use variables on the spatial distribution of the target mammal species: dung densities were highest in the national park and game reserves, intermediate in less-strictly protected areas and lowest in un-protected areas. Beyond species-specific environmental predictors for dung densities, our results highlight consistent negative associations between dung densities of the target species and distance to cropland and avoidance of areas in proximity to houses. Our findings underpin differences in ecological effectiveness of protected areas within one ecosystem. Protection level and land use play crucial roles in moderating the spatial distribution of all considered mammal species. Our findings suggest that a landscape approach needs to guide effective conservation across the entire protection gradient of the Katavi-Rukwa Ecosystem.Item Dataset: Multiple paternity and number of offspring in mammals(Zenodo, 2018-10-23) Dobson, F. Stephen; Abebe, Ash; Correia, Hannah; Kasumo, Christian; Zinner, Bertram; Correia, HannahMany cooperative social attributes are being linked to characteristics of mating systems, particularly to the rate of multiple paternity that typifies a population. Under the logic that greater offspring production by females should engender greater competition among males to mate with females, it is predicted that multiple paternity should increase with litter sizes. We tested the predicted positive association of multiple paternity and litter size with a meta-analysis of 59 species of mammals. The probability of multiple paternity and mean litter size were positively correlated, but not significantly (Zr = 0.202). Also, the mean number of sires of litters increased with mean litter size, but not significantly (Zr = 0.235). We developed a combinatorial formula for the influence of number of male mates and litter size on the probability of multiple paternity. We used Bayesian Markov chain Monte Carlo simulations to generate an expectation for the form of the relationship between the probability of multiple paternity and mean litter size. Under the assumption of random samplings of numbers of mates, the expected association of the probability of multiple paternity and mean litter sizes among species was positive, curvilinear, and relatively high. However, the empirical probabilities of multiple paternities were much less than expected, suggesting that behavioral factors (such as mating-associated behaviors) or ecological characteristics (such as population density) likely limit the number of male mates for reproductive females. The probability of multiple paternity in a population is an estimate of mating patterns that does not closely reflect the number of sires of individual litters. We suggest use of the estimated probability of mating success for males as an alternative measure of their contribution to the mating system.Item Dataset: SNP genotyping of indigenous goats of Uganda based on the Goat_IGGC_65K_v2 illumina chip(Zenodo, 2024) Nantongo, Ziwena; Birungi, Josephine; Obol Opiyo, Stephen; Shirima, Gabriel; Mugerwa, Swidiq; Mutai, Collins; Kyalo, Martina; Munishi, Linus; Agaba, Morris; Mrode, RaphaelUganda's indigenous goats are characterised based on ethnic communities that raise them, average mature weight, and hair coat characteristics. Uganda's indigenous goats have been genotyped based on the Goat_IGGC_65K_v2 illumina chip to study their population structure and genetic characteristics. Information generated from this data is vital for the sustainable utilisation, development, and conservation of Uganda's goat genetic resources. Methods The data was collected from genomic DNA extracted from ear tissue samples of 1032 indigenous goats from different agro-ecological zones of Uganda. The study aimed to characterise the genetic diversity, population structure and signatures of selection of indigenous goats in different agroecological zones of Uganda. Indigenous goats were identified in their known phenotypic/ ethnic classifications of Mubende, Kigezi and Small East African goats in all the 10 agroecologies. Ear tissue samples were collected from each goat using the allflex tissue sampling system that allows for no cross contamination as each sample self locks in a single use tube after it is collected. Genomic DNA was extracted from each sample using the TANBEAD automated DNA extraction system with the 6T2 total tissue DNA extraction kit. DNA samples were genotyped based on the Goat_IGGC_65K_v2 illumina chip. Data processing After genotyping, 59725 SNPs from 1032 genotypes were received in PLINK format. The data was opened in Tassel Version 5.2 and saved as VCF format for further processing. The near complete goat genome (ARS1) was used as reference to align all the SNPs to the right chromosomes and chromosome positions using R (version 4.3.3) software. The final data set was saved as a VCF file for analysis.Item Dataset: Land Degradation and Soil Erosion Control Measures in the Maasai Steppe, Northern Tanzania(Zenodo, 2024-10-01) Mtei, Kelvin; Kalonga, Joseph; Kimaro, Anthony; Massawe, Boniface; Winowiecki, LeighThe data set describes the extent of land degradation and control measures of soil erosion in Maasai steppe of northern Tanzania. Data were collected between 2021 and 2022.Item Dataset: Data on Soil Characteristics and Fluoride Dynamics with Dried Seaweed Hydroxyapatite Treatment in Arusha, Tanzania(Zenodo, 2023-04-01) Mtei, KelvinThe data set describes soil properties including fluoride contents in the soil and performance of dried seaweed hydroxyappatite on reduction of water soluble Fluoride in soils. Data were collected in Arusha, Northern Tanzania in 2021. The dataset has been developed in the framework the H2020 EWA BELT Project (GA-862848) -Linking East and West African farming systems experience into a BELT of sustainable intensification- research activities.Item Dataset: Integrating stakeholders' perspectives and spatial modelling to develop scenarios of future land use and land cover change in northern Tanzania(Zenodo, 2021-01-13) Kariuki, Rebecca; Munishi, Linus; Courtney-Mustaphi, Colin; Capitani, Claudia; Shoemaker, Anna; Lane, Paul; Marchant, RobRapid rates of land use and land cover change (LULCC) in eastern Africa and limited instances of genuinely equal partnerships involving scientists, communities and decision makers challenge the development of robust pathways toward future environmental and socioeconomic sustainability. We use a participatory modelling tool, Kesho, to assess the biophysical, socioeconomic, cultural and governance factors that influenced past (1959-1999) and present (2000-2018) LULCC in northern Tanzania and to simulate four scenarios of land cover change to the year 2030. Simulations of the scenarios used spatial modelling to integrate stakeholders' perceptions of future environmental change with social and environmental data on recent trends in LULCC. From stakeholders' perspectives, between 1959 and 2018, LULCC was influenced by climate variability, availability of natural resources, agriculture expansion, urbanization, tourism growth, and legislation governing land access and natural resource management. Among other socio-environmental-political LULCC drivers, the stakeholders envisioned that from 2018 to 2030 LULCC will largely be influenced by land health, natural and economic capital, and political will in implementing land use plans and policies. The projected scenarios suggest that by 2030 agricultural land will have expanded by 8-20% under different scenarios and herbaceous vegetation and forest land cover will be reduced by 2.5-5% and 10-19% respectively. Stakeholder discussions further identified desirable futures in 2030 as those with improved infrastructure, restored degraded landscapes, effective wildlife conservation, and better farming techniques. The undesirable futures in 2030 were those characterized by land degradation, poverty, and cultural loss. Insights from our work identify the implications of future LULCC scenarios on wildlife and cultural conservation and in meeting the Sustainable Development Goals (SDGs) and targets by 2030. The Kesho approach capitalizes on knowledge exchanges among diverse stakeholders, and in the process promotes social learning, provides a sense of ownership of outputs generated, democratizes scientific understanding, and improves the quality and relevance of the outputs.Item Dataset: SNP genotyping of indigenous goats of Uganda based on the Goat_IGGC_65K_v2 illumina chip(2024-05-23) Nantongo, Ziwena; Birungi, Josephine; Obol Opiyo, Stephen; Shirima, Gabriel; Mugerwa, Swidiq; Mutai, Collins; Kyalo, Martina; Munishi, Linus; Agaba, Morris; Mrode, RaphaelUganda's indigenous goats are characterised based on ethnic communities that raise them, average mature weight, and hair coat characteristics. Uganda's indigenous goats have been genotyped based on the Goat_IGGC_65K_v2 illumina chip to study their population structure and genetic characteristics. Information generated from this data is vital for the sustainable utilisation, development, and conservation of Uganda's goat genetic resources.Item Dataset: Availability and functionality of neontal care units in health facilities(2022-07-21) Kamala, ServeusBackground Evidence shows that delivery of prompt and appropriate in-patient newborn care (IPNC) through health facility (HF)-based neonatal care and stabilization units (NCU/NSUs) reduce preventable newborn mortalities (NMs). This study investigated the HFs for the availability and performance of NCU/NSUs in providing quality IPNC, as well as an investigation of influencing factors in Mtwara region, Tanzania. Results About 70.6% (12/17) of surveyed HFs had at least one NCU/NSU room dedicated for delivery of IPNC, and 74.7% (3,600/4,819) of needy newborns were admitted/transferred in for management. Essential medicines such as tetracycline eye ointment was unavailable in 75% (3/4) of the district hospitals (DHs). A disparity existed between the availability and functioning of equipment including infant radiant warmers (92% vs 73%). Governance, support from implementing patterns (IPs), and access to healthcare commodities were identified from qualitative inquiries as factors influencing the establishment and running of NCUs/NSUs at the HFs in Mtwara region, Tanzania. Conclusion Despite the positive progress, the establishment and performance of NCUs/NSUs in providing quality IPNC in HFs in Mtwara has lagged behind the Tanzania neonatal care guideline standards, particularly after the IPs of newborn health interventions completed their terms in 2016. This study suggests additional improvement plans for Mtwara region and other comparable settings to optimize the provision of quality IPNC and lower avoidable NMs. NotesItem Dataset: Machine Learning Dataset for Poultry Diseases Diagnostics - PCR annotated(2023-12-23) Machuve, Dina; Nwankwo, Ezinne; Lyimo, Emmanuel; Maguo, Evarist; Munisi, CharlesThe dataset of poultry disease diagnostics was annotated using Polymerase Chain Reaction (PCR). Polymerase Chain Reaction (PCR) is a molecular biology technique for rapid diagnostics. We gathered both the fecal images and fecal samples from layers, cross and indigenous breeds of chicken from poultry farms in Arusha and Kilimanjaro regions in Tanzania between September 2020 and February 2021. Each fecal sample collected was coded to its corresponding image during data collection. PCR method is used for detection and identification of pathogens through amplification of DNA sequences unique to the pathogen. We used existing primers from literature to amplify the target DNA/RNA on the poultry fecal samples for PCR. The targets were Coccidiosis, Newcastle disease and Salmonella. We used the primers for PCR diagnostics at the molecular laboratory of the Nelson Mandela African Institution of Science and Technology (NM-AIST). The fecal samples were stored at -80 degrees celsius. The PCR diagnostics were conducted using reagents and kits from Zymo Research and the protocol is summarized in these five stages: 1. DNA sample loading 2. DNA extraction 3. Amplification; 4. Quantification and 5. Detection.Item Dataset: A Labeled Dataset of Healthy and Diseased Maize from Tanzania(Zenodo, 2025-07-04) Mduma, Neema; Laizer, Hudson; Kiriba, DeodatusThe maize images dataset was developed to support research in diagnosing major maize diseases and improving crop yields. Sponsored by GrowFurther and conducted by researchers at the Nelson Mandela African Institution of Science and Technology (NM-AIST) in collaboration with the Tanzania Agriculture Research Institute (TARI), the project produced a labeled dataset featuring healthy maize, Maize Lethal Necrosis and Maize Streak Virus. The dataset is designed for image classification and object detection tasks.Item Dataset: A Labeled Dataset of Healthy and Diseased Common Beans from Tanzania(Zenodo, 2025-07-01)The common bean images dataset was developed to advance research in diagnosing key bean diseases and boosting crop productivity. Sponsored by GrowFurther and carried out by researchers at the Nelson Mandela African Institution of Science and Technology (NM-AIST) in collaboration with the Tanzania Agriculture Research Institute (TARI), the project produced a labeled dataset featuring leaf images of healthy beans, Bean Anthracnose and Bean Rust. The dataset supports both image classification and object detection tasks.Item Dataset: Evolution of toll-like receptors in the context of terrestrial ungulates and cetaceans diversification(Zenodo, 2017-02-06) Ishengoma, Edson; Agaba, MorrisBackground: Toll-like receptors (TLRs) are the frontline actors in the innate immune response to various pathogens and are expected to be targets of natural selection in species adapted to habitats with contrasting pathogen burdens. The recent publication of genome sequences of giraffe and okapi together afforded the opportunity to examine the evolution of selected TLRs in broad range of terrestrial ungulates and cetaceans during their complex habitat diversification. Through direct sequence comparisons and standard evolutionary approaches, the extent of nucleotide and protein sequence diversity in seven Toll-like receptors (TLR2, TLR3, TLR4, TLR5, TLR7, TLR9 and TLR10) between giraffe and closely related species was determined. In addition, comparison of the patterning of key TLR motifs and domains between giraffe and related species was performed. The quantification of selection pressure and divergence on TLRs among terrestrial ungulates and cetaceans was also performed. Results: Sequence analysis shows that giraffe has 94–99% nucleotide identity with okapi and cattle for all TLRs analyzed. Variations in the number of Leucine-rich repeats were observed in some of TLRs between giraffe, okapi and cattle. Patterning of key TLR domains did not reveal any significant differences in the domain architecture among giraffe, okapi and cattle. Molecular evolutionary analysis for selection pressure identifies positive selection on key sites for all TLRs examined suggesting that pervasive evolutionary pressure has taken place during the evolution of terrestrial ungulates and cetaceans. Analysis of positively selected sites showed some site to be part of Leucine-rich motifs suggesting functional relevance in species-specific recognition of pathogen associated molecular patterns. Notably, clade analysis reveals significant selection divergence between terrestrial ungulates and cetaceans in viral sensing TLR3. Mapping of giraffe TLR3 key substitutions to the structure of the receptor indicates that at least one of giraffe altered sites coincides with TLR3 residue known to play a critical role in receptor signaling activity. Conclusion: There is overall structural conservation in TLRs among giraffe, okapi and cattle indicating that the mechanism for innate immune response utilizing TLR pathways may not have changed very much during the evolution of these species. However, a broader phylogenetic analysis revealed signatures of adaptive evolution among terrestrial ungulates and cetaceans, including the observed selection divergence in TLR3. This suggests that long term ecological dynamics has led to species-specific innovation and functional variation in the mechanisms mediating innate immunity in terrestrial ungulates and cetaceans.Item Dataset: Banana Imagery Dataset - Tanzania(Zenodo, 2023-02-23) Mduma, Neema; Elinisa, ChristianThe banana images dataset was created to contribute to the study of banana diseases diagnostics. The images target the diagnostics of Black Sigatoka and Fusarium Wilt Race 1 diseases. We are motivated in developing end to end tools to help farmers diagnose diseases and improve banana productivity. The dataset was created to facilitate image classification and object detection tasks.Item Dataset: Optimizing LoRaWAN Throughput in Maritime Environments Through Adaptive Coding and Modulation in Rayleigh Fading Channels(Zenodo, 2025) Lyimo, Martine; Mgawe, Bonny; Leo, Judith; Dida, Mussa; Michael, KisangiriThis dataset supports the article “Optimizing LoRaWAN Throughput in Maritime Environments through Adaptive Coding and Modulation under Rayleigh Fading”. It includes simulation outputs and MATLAB source code for reproducing all figures and results in the study. Files include throughput, PER, energy efficiency, spectral efficiency, and an ACM algorithm function.