Hi.
So I installed and got qpAdm running without having too many problems but merging my files with the AADR dataset is proving to be impossible. First I tried merging the files directly in the EIGENSTRAT format(with -mergeit) but didnt get very far, then I read here about converting both the AADR dataset and your personal files in PLINK format to merge them and I followed the
@Jovialis tutorial (post #201), which did work but I encountered another problem when converting the merged files back to EIGENSTRAT (as far as I know qpAdm only works with this format except if your are using AT2), it leaves out me out!
This is the output:
View attachment 16915
Any idea why this happens? maybe my original RAW DNA file has poor coverage(MyHeritage, 600k SNPs)
View attachment 16916
^here is the problem I believe
I ran it through Chatgpt, see if it provides you any help:
The warnings in your PLINK output suggest two specific issues:
### 1. **Het. Haploid Genotypes**
- **Warning:** "10996 het. haploid genotypes present."
- **Explanation:** This means that certain genotypes on sex chromosomes (X/Y) are flagged as heterozygous in individuals who should have only one allele (males on the X or Y chromosome, or anyone on the mitochondrial DNA). PLINK treats these as errors since haploid regions should only have one allele.
- **Solution:**
- Ensure that the sex of individuals is correctly specified in your `.fam` file.
- You can exclude problematic variants or correct these genotypes. Use the following command:
```bash
plink --bfile [your_file] --set-hh-missing --make-bed --out corrected_data
```
This will set heterozygous haploid genotypes to missing.
### 2. **Nonmissing Nonmale Y Chromosome Genotypes**
- **Warning:** "Nonmissing nonmale Y chromosome genotype(s) present."
- **Explanation:** Genotypes for the Y chromosome are present in individuals who are not male (as per their sex information in the `.fam` file). PLINK expects only males to have Y chromosome data.
- **Solution:**
- Check for errors in the `.fam` file’s sex designation. Ensure that:
- `1` = male
- `2` = female
- `0` = unknown
- If there are data inconsistencies, you can filter or correct them:
```bash
plink --bfile [your_file] --filter-males --make-bed --out males_only
```
### 3. **Phenotype Warnings**
- "16398 are cases and 0 are controls. (1 phenotype is missing.)"
- **Explanation:** This means the `.fam` file contains only case phenotypes (coded as `2`) and no controls (`1`), which is likely incorrect unless you are analyzing a dataset with no controls.
- **Solution:** If this is unintentional, check the phenotype column in your `.fam` file. You might need to correct missing or mislabeled phenotypes.
Would you like assistance fixing the input files or running specific commands?