|Home | About | Journals | Submit | Contact Us | Français|
SF1 and SF2 helicases are important molecular motors that use the energy of ATP to unwind nucleic acids or nucleic-acid protein complexes. They are ubiquitous enzymes and found in almost all organisms sequenced to date. This article provides a comparative analysis for SF1 and SF2 helicase families from three domains of life archaea, human, bacteria. Seven families are conserved in these three representatives and includes Upf1-like, UvrD-like, Rad3-like, DEAD-box, RecQ-like. Snf2 and Ski2-like. The data highlight conservation of the helicase core motifs for each of these families. Phylogenetic analysis presented on certain protein families are essential for further studies tracing the evolutionary history of helicase families. The data supplied in this article support publication “Genome-wide identification of SF1 and SF2 helicases from archaea” (Chamieh et al., 2016) .
Value of the data
Four figure files are presented. Fig. 1 denotes a comparative analysis of helicase core motifs in conserved families from archaea, bacteria and human. Fig. 2, Fig. 3, Fig. 4 are phylogenetic trees obtained after Maximum Likelihood analysis for Upf1-like and Rad3-like families, and Bayesian analysis for ski2-like helicase family.
All protein sequences were retrieved from existing protein databases and were used with their UniProt accession numbers and were classified into different families as shown in Chamieh et al. , . Multiple protein sequence alignment was performed using T-COFFEE EXPRESSO program for small sequence numbers (<150 sequences)  or PromalS3D for large sequence numbers (>150 sequences) . Fig. 1 was obtained from the multiple sequence alignment files for protein sequences within the same family using the WEBLOGO software . Sequences were inspected for their correct alignment within the helicase core domain. Multiple sequence alignment was trimmed using TrimAl v1.3 method set to automated . The best evolutionary fit model was identified using ProtTest . Phylogenetic analysis was performed using Maximum Likelihood analysis from MEGA7 software  or MrBayes with the TOPALI platform , .
Transparency documentTransparency data associated with this article can be found in the online version at doi:10.1016/j.dib.2017.02.047.