# Selecting Genes from a DEG List

Aarthi Ramakrishnan

Let's consider the scenario where we would like to select genes from a differential expression analysis list that show at least 30% increase or 30% decrease in expression. This can be confusing at first if you attempt it based on the log fold change values. So let's look into this further -

Case 1 - 30% Increase:

Let's assume the expression of gene X is 10 in the control condition and 13 in the treatment condition. In this case, gene X is up-regulated and there is a 30% increase in expression (13-10)/10. In this case, the foldChange would be 13/10 = 1.3. And the log2FoldChange would be log2(13/10) = 0.378. Therefore, one may use a log2FoldChange threshold of 0.378 and above to obtain genes that are up-regulated with at least 30% increase.

Case 2 - 30% Decrease:

Assuming the expression of gene X is 10 in the control condition and 7 in the treatment condition, gene X is down-regulated and there is a 30% decrease in expression (7-10)/10. In this case, foldChange = 7/10 = 0.7 and the log2FoldChange = log2(7/10) = -0.514. Therefore, one may use a log2FoldChange threshold of -0.514 (and below) to obtain genes that are down-regulated with at least 30% decrease.

Convert log2FoldChange to foldChange:

2^log2FoldChange = foldChange. For example: 2 ^ -0.514 = 0.7

Bioinformatics

## Interesting Bioinformatics Articles

Following is a collection of articles which I feel every Bioinformatician must be aware of. I will keep updating this list from time to time - 1. All biology is computational biology 2. Core services: Reward bioinformaticians 3. Importance of stupidity in scientific research

## Screen command in UNIX

Screen is a very useful command to have in your toolbox if you frequently use interactive sessions on your supercomputer logged in through a VPN. A VPN typically has a time limit, and you may get disconnected from it without any warning when you have poor internet connection. Screen program

## Types of models in DESeq2

There are 2 major types of regression models one can specify in DESeq2 to explore the raw count matrices from an RNA-seq experiment - * Mean-reference model for Factors * Regression model for Covariates Mean-reference model for Factors - Factors typically represent categorical variable such as Gender, Ethnicity, Race etc. The mean-reference