Best AI tools for< Data Manipulation >
20 - AI tool Sites

Dot Analytics
Dot Analytics is a growth-focused data analytics agency that offers a wide range of services including data analytics, data engineering, data visualization, data science, big data analytics, AI consulting, and more. They specialize in providing analytics solutions for data-driven business managers seeking accuracy, statistics, and data to drive revenue growth. With over 6 years of experience, they offer tailored analytics solutions to optimize customer acquisition cost, lifetime value, average order value, and conversions. Dot Analytics partners with clients from various industries to provide transparent, maintenance, and optimization services.

Onvo AI
Onvo AI is an embedded data visualization platform that allows users to create interactive dashboards and data visualizations using simple AI prompts. It connects to various data sources, enabling users to build insightful charts and visualizations without the need for complex queries or manual data manipulation. Onvo AI offers customization options, allowing users to match their brand's identity and provides secure sharing and embedding features. It also includes analytics capabilities, enabling users to track dashboard usage and user engagement.

InstantAPI.ai
InstantAPI.ai is an AI-powered web scraping tool that allows developers, data scientists, and SEO specialists to instantly turn any web page into a personalized API. With the ability to effortlessly scrape, customize, and integrate data, users can enhance their projects, drive insights, and optimize performance. The tool offers features such as scraping precise data, transforming information into various formats, generating new content, providing advanced analysis, and extracting valuable insights from data. Users can tailor the output to meet specific needs and unleash creativity by using AI for unique purposes. InstantAPI.ai simplifies the process of web scraping and data manipulation, offering a seamless experience for users seeking to leverage AI technology for their projects.

Floneum
Floneum is a versatile AI-powered tool designed for language tasks. It offers a user-friendly interface to build workflows using large language models. With Floneum, users can securely extend functionality by writing plugins in various languages compiled to WebAssembly. The tool provides a sandboxed environment for plugins, ensuring limited resource access. With 41 built-in plugins, Floneum simplifies tasks such as text generation, search engine operations, file handling, Python execution, browser automation, and data manipulation.

Realflow.ai
Realflow.ai is an AI-powered platform that offers GPT & AI capabilities for Citizen Developers. It provides a unique Visual Data Pathways Builder, 185 File, Database and SaaS Connectors, and Excel Formula Transforms for data manipulation. The platform simplifies data integration and transformation processes by eliminating the need for scripting languages and SQL queries. Realflow.ai aims to empower users with no-code solutions to build integrations and streamline data workflows across various platforms.

Formularizer
Formularizer is an AI-powered assistant designed to help users with formula-related tasks in spreadsheets like Excel, Google Sheets, and Notion. It provides step-by-step guidance, formula generation, and explanations to simplify complex formula creation and problem-solving. With support for regular expressions, Excel VBA, and Google Apps Script, Formularizer aims to enhance productivity and make data manipulation more accessible.

Tomat.AI
Tomat.AI is an AI-powered tool designed to help users open and explore large CSV files effortlessly. With features like automated data profiling, merging multiple files, and building reports, Tomat.AI simplifies the process of analyzing and automating Excel and CSV files without the need for coding skills. The tool ensures data security by operating entirely on the user's local machine, offering a user-friendly interface for seamless data manipulation and analysis.

LMarena.ai
LMarena.ai is an AI-powered security service that protects websites from online attacks by enabling cookies and blocking malicious activities. It uses advanced algorithms to detect and prevent security threats, ensuring a safe browsing experience for users. The service is designed to safeguard websites from various types of attacks, such as SQL injections and data manipulation. LMarena.ai offers a reliable and efficient security solution for website owners to maintain the integrity and performance of their online platforms.

generatejson.com
The website generatejson.com appears to be inaccessible due to an 'Access Denied' error. It seems that users are encountering permission issues when trying to access the site. The error message references a server issue and provides a specific reference number. The website may be related to generating JSON data, but further details are not available from the provided text.

PPnude
PPnude is an AI tool that offers various features such as Deepnude, Deepfake, Face Swap, and AI Anime Generator. Users can easily create nude images, swap faces in photos and videos, generate AI anime art, and more. The tool provides creative freedom for users to explore different possibilities in image manipulation and content creation. PPnude ensures user privacy and security by not storing any data or photos. It offers a user-friendly experience with no installation required, making it a convenient choice for those interested in AI-generated adult content.

Lilac
Lilac is an AI tool designed to enhance data quality and exploration for AI applications. It offers features such as data search, quantification, editing, clustering, semantic search, field comparison, and fuzzy-concept search. Lilac enables users to accelerate dataset computations and transformations, making it a valuable asset for data scientists and AI practitioners. The tool is trusted by Alignment Lab and is recommended for working with LLM datasets.

Dobb·E
Dobb·E is an open-source, general framework for learning household robotic manipulation. It aims to create a 'generalist machine' for homes, a domestic assistant that can adapt and learn various tasks cost-effectively. Dobb·E can learn a new task with just five minutes of demonstration, achieving an 81% success rate in 10 NYC homes. The system is designed to accelerate research on home robots and eventually enable robot butlers in every home.

Toggle Terminal
Toggle Terminal is an AI-powered platform that brings data to life with natural language. It offers a suite of award-winning analytic tools wrapped in an accessible, natural language-based user experience. Users can ask questions in plain language and receive immediate, data-backed answers without the need for coding or spreadsheet manipulation. Toggle Terminal provides institutional-grade analytical tools for scenario testing, asset intelligence, chart exploration, and idea discovery. It helps users connect data, test market hypotheses, screen securities, and explore hidden relationships between organizations. Additionally, Toggle AI offers customized AI solutions and integrations for institutional investors in asset management and capital markets.

Facia.ai
Facia.ai is a cutting-edge AI application that specializes in fast and accurate face recognition with 3D liveness detection. It offers solutions for businesses and governments to prevent identity fraud, deepfake manipulation, and enhance security through facial biometric analysis. The platform provides advanced features such as face matching, ID document verification, deepfake detection, age estimation, and iris recognition. Facia.ai stands out for its industry-leading accuracy, customizable integration, and user-driven design philosophy, ensuring a reliable and secure experience for users.

Music Business Worldwide
Music Business Worldwide is a platform that provides news, interviews, analysis, and job opportunities for the global music industry. It covers a wide range of topics such as artist management, music production, songwriting, industry insights, and financial reports. The platform aims to keep music professionals informed about the latest trends and developments in the music business.

Face Swap
Face-swap.app is a free AI-powered tool that allows users to easily swap faces in photos and videos. With a user-friendly interface and lightning-fast processing speed, the tool seamlessly blends target faces into base photos to create natural and realistic results. The application prioritizes privacy and security, ensuring that facial data is never stored. Users can enjoy unlimited fun and peace of mind while using this complimentary service.

Towards Data Science
Towards Data Science is a Medium publication dedicated to sharing concepts, ideas, and codes in the field of data science. It provides a platform for data scientists, researchers, and practitioners to connect, learn, and contribute to the advancement of the field.

What's The Big Data
What's The Big Data is an AI tool directory that helps users unleash their potential by providing a comprehensive source for AI tools, data, and ChatGPT. The platform is updated daily and caters to every need, offering a wide range of AI assistants across various categories. Users can easily find their perfect AI assistant with just a click, making it a valuable resource for those seeking AI solutions.

Data Science Dojo
Data Science Dojo is a globally recognized e-learning platform that offers programs in data science, data analytics, machine learning, and more. They provide comprehensive and hands-on training in various formats such as in-person, virtual instructor-led, and self-paced training. The focus is on helping students develop a think-business-first mindset to apply their data science skills effectively in real-world scenarios. With over 2500 enterprises trained, Data Science Dojo aims to make data science accessible to everyone.

Domino Data Lab
Domino Data Lab is an enterprise AI platform that enables data scientists and IT leaders to build, deploy, and manage AI models at scale. It provides a unified platform for accessing data, tools, compute, models, and projects across any environment. Domino also fosters collaboration, establishes best practices, and tracks models in production to accelerate and scale AI while ensuring governance and reducing costs.
20 - Open Source AI Tools

TableLLM
TableLLM is a large language model designed for efficient tabular data manipulation tasks in real office scenarios. It can generate code solutions or direct text answers for tasks like insert, delete, update, query, merge, and chart operations on tables embedded in spreadsheets or documents. The model has been fine-tuned based on CodeLlama-7B and 13B, offering two scales: TableLLM-7B and TableLLM-13B. Evaluation results show its performance on benchmarks like WikiSQL, Spider, and self-created table operation benchmark. Users can use TableLLM for code and text generation tasks on tabular data.

driverlessai-recipes
This repository contains custom recipes for H2O Driverless AI, which is an Automatic Machine Learning platform for the Enterprise. Custom recipes are Python code snippets that can be uploaded into Driverless AI at runtime to automate feature engineering, model building, visualization, and interpretability. Users can gain control over the optimization choices made by Driverless AI by providing their own custom recipes. The repository includes recipes for various tasks such as data manipulation, data preprocessing, feature selection, data augmentation, model building, scoring, and more. Best practices for creating and using recipes are also provided, including security considerations, performance tips, and safety measures.

pandas-ai
PandaAI is a Python platform that enables users to interact with their data in natural language, catering to both non-technical and technical users. It simplifies data querying and analysis, offering conversational data analytics capabilities with minimal code. Users can ask questions, visualize charts, and compare dataframes effortlessly. The tool aims to streamline data exploration and decision-making processes by providing a user-friendly interface for data manipulation and analysis.

sql-eval
This repository contains the code that Defog uses for the evaluation of generated SQL. It's based off the schema from the Spider, but with a new set of hand-selected questions and queries grouped by query category. The testing procedure involves generating a SQL query, running both the 'gold' query and the generated query on their respective database to obtain dataframes with the results, comparing the dataframes using an 'exact' and a 'subset' match, logging these alongside other metrics of interest, and aggregating the results for reporting. The repository provides comprehensive instructions for installing dependencies, starting a Postgres instance, importing data into Postgres, importing data into Snowflake, using private data, implementing a query generator, and running the test with different runners.

sec-parser
The `sec-parser` project simplifies extracting meaningful information from SEC EDGAR HTML documents by organizing them into semantic elements and a tree structure. It helps in parsing SEC filings for financial and regulatory analysis, analytics and data science, AI and machine learning, causal AI, and large language models. The tool is especially beneficial for AI, ML, and LLM applications by streamlining data pre-processing and feature extraction.

Awesome-Tabular-LLMs
This repository is a collection of papers on Tabular Large Language Models (LLMs) specialized for processing tabular data. It includes surveys, models, and applications related to table understanding tasks such as Table Question Answering, Table-to-Text, Text-to-SQL, and more. The repository categorizes the papers based on key ideas and provides insights into the advancements in using LLMs for processing diverse tables and fulfilling various tabular tasks based on natural language instructions.

lhotse
Lhotse is a Python library designed to make speech and audio data preparation flexible and accessible. It aims to attract a wider community to speech processing tasks by providing a Python-centric design and an expressive command-line interface. Lhotse offers standard data preparation recipes, PyTorch Dataset classes for speech tasks, and efficient data preparation for model training with audio cuts. It supports data augmentation, feature extraction, and feature-space cut mixing. The tool extends Kaldi's data preparation recipes with seamless PyTorch integration, human-readable text manifests, and convenient Python classes.

aps-toolkit
APS Toolkit is a powerful tool for developers, software engineers, and AI engineers to explore Autodesk Platform Services (APS). It allows users to read, download, and write data from APS, as well as export data to various formats like CSV, Excel, JSON, and XML. The toolkit is built on top of Autodesk.Forge and Newtonsoft.Json, offering features such as reading SVF models, querying properties database, exporting data, and more.

NBA-Machine-Learning-Sports-Betting
This tool is a machine learning AI used to predict the winners and under/overs of NBA games. It takes all team data from the 2007-08 season to the current season, matched with odds of those games, and uses a neural network to predict winning bets for today's games. The tool achieves ~69% accuracy on money lines and ~55% on under/overs. It outputs expected value for teams' money lines to provide better insight and the fraction of your bankroll to bet based on the Kelly Criterion. A popular, less risky approach is to bet 50% of the stake recommended by the Kelly Criterion.

100days_AI
The 100 Days in AI repository provides a comprehensive roadmap for individuals to learn Artificial Intelligence over a period of 100 days. It covers topics ranging from basic programming in Python to advanced concepts in AI, including machine learning, deep learning, and specialized AI topics. The repository includes daily tasks, resources, and exercises to ensure a structured learning experience. By following this roadmap, users can gain a solid understanding of AI and be prepared to work on real-world AI projects.

llm-course
The LLM course is divided into three parts: 1. 🧩 **LLM Fundamentals** covers essential knowledge about mathematics, Python, and neural networks. 2. 🧑🔬 **The LLM Scientist** focuses on building the best possible LLMs using the latest techniques. 3. 👷 **The LLM Engineer** focuses on creating LLM-based applications and deploying them. For an interactive version of this course, I created two **LLM assistants** that will answer questions and test your knowledge in a personalized way: * 🤗 **HuggingChat Assistant**: Free version using Mixtral-8x7B. * 🤖 **ChatGPT Assistant**: Requires a premium account. ## 📝 Notebooks A list of notebooks and articles related to large language models. ### Tools | Notebook | Description | Notebook | |----------|-------------|----------| | 🧐 LLM AutoEval | Automatically evaluate your LLMs using RunPod |  | | 🥱 LazyMergekit | Easily merge models using MergeKit in one click. |  | | 🦎 LazyAxolotl | Fine-tune models in the cloud using Axolotl in one click. |  | | ⚡ AutoQuant | Quantize LLMs in GGUF, GPTQ, EXL2, AWQ, and HQQ formats in one click. |  | | 🌳 Model Family Tree | Visualize the family tree of merged models. |  | | 🚀 ZeroSpace | Automatically create a Gradio chat interface using a free ZeroGPU. |  |

create-million-parameter-llm-from-scratch
The 'create-million-parameter-llm-from-scratch' repository provides a detailed guide on creating a Large Language Model (LLM) with 2.3 million parameters from scratch. The blog replicates the LLaMA approach, incorporating concepts like RMSNorm for pre-normalization, SwiGLU activation function, and Rotary Embeddings. The model is trained on a basic dataset to demonstrate the ease of creating a million-parameter LLM without the need for a high-end GPU.

awesome-ai-ml-resources
This repository is a collection of free resources and a roadmap designed to help individuals learn Machine Learning and Artificial Intelligence concepts by providing key concepts, building blocks, roles, a learning roadmap, courses, certifications, books, tools & frameworks, research blogs, applied blogs, practice problems, communities, YouTube channels, newsletters, and must-read papers. It covers a wide range of topics from supervised learning to MLOps, offering guidance on learning paths, practical experience, and job interview preparation.

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing
LLM-PowerHouse is a comprehensive and curated guide designed to empower developers, researchers, and enthusiasts to harness the true capabilities of Large Language Models (LLMs) and build intelligent applications that push the boundaries of natural language understanding. This GitHub repository provides in-depth articles, codebase mastery, LLM PlayLab, and resources for cost analysis and network visualization. It covers various aspects of LLMs, including NLP, models, training, evaluation metrics, open LLMs, and more. The repository also includes a collection of code examples and tutorials to help users build and deploy LLM-based applications.

GenAI_Agents
GenAI Agents is a comprehensive repository for developing and implementing Generative AI (GenAI) agents, ranging from simple conversational bots to complex multi-agent systems. It serves as a valuable resource for learning, building, and sharing GenAI agents, offering tutorials, implementations, and a platform for showcasing innovative agent creations. The repository covers a wide range of agent architectures and applications, providing step-by-step tutorials, ready-to-use implementations, and regular updates on advancements in GenAI technology.

DecryptPrompt
This repository does not provide a tool, but rather a collection of resources and strategies for academics in the field of artificial intelligence who are feeling depressed or overwhelmed by the rapid advancements in the field. The resources include articles, blog posts, and other materials that offer advice on how to cope with the challenges of working in a fast-paced and competitive environment.
20 - OpenAI Gpts

MeepMouse
MeepMouse, the advanced computer mouse for developers, displays logs of edits made in a virtual IDE, simulating direct code manipulation.

Pymage
Enginyer de Python per a la creació i manipulació d'imatges i arxius.Fàcil,clar i Català.

Your Business Data Optimizer Pro
A chatbot expert in business data analysis and optimization.

Data Dynamo
A friendly data science coach offering practical, useful, and accurate advice.

DataKitchen DataOps and Data Observability GPT
A specialist in DataOps and Data Observability, aiding in data management and monitoring.

Alas Data Analytics Student Mentor
Salam mən Alas Academy-nin Data Analitika üzrə Süni İntellekt mentoruyam. Mənə istənilən sualı verə bilərsiniz :)

CannaIndustry Data Expert
Data trend analysis expert in cannabis, also skilled in image and data analysis, document generation, and web search.