This publication is intended to shed light on the situation that “some” institutions have begun to understand and value their data as an internal product that will boost, qualify and distinguish their products concerning the market in the short term. This same company that, unfortunately, internally, each business area built its architecture, many times without a standard, without quality or basic sanitation and making the distribution in an archaic and slow way, full of “gambiaras” a veritable data slum.
In this chaotic situation, it is very common for business people to be somehow alienated, blaming situations of slowness in a…
Essa publicação se destina a dar uma luz para a situação que “algumas” instituições que começaram a entender e valorizar seus dados como sendo um produto interno que impulsiona, qualifica e distingui seus produtos em relação ao mercado a curto prazo. Essas mesmas empresas que infelizmente, internamente, cada área de negócio construiu sua arquitetura muitas das vezes sem um padrão, sem uma qualidade ou saneamento básico e fazendo a distribuição de maneira arcaica e lenta, cheias de gambiaras, uma verdadeira favela de dados.
Nessa situação caótica é muito comum que as pessoas do negócio estejam de certa forma alienadas, colocando…
Jupyter emerged and gained respect for being an easy-to-install solution, in addition to bringing the proposed use to facilitate coding and visualization of the code, a form of interactive computing aimed at usability. With the success of Python, other market tools started to support this language and compete directly with Jupyter.
#Now that I spoke well about jupyter, let’s go to the competition… :)
Cassandra is a NoSQL database developed to ensure rapid scalability and high availability of data, being open source and maintained mainly by the Apache Foundation and its community.
Its main features are:
According to Apache, we can look at Cassandra as being:
O Cassandra é um bancos de dados NoSQL, desenvolvido para garantir rápida escalabilidade e alta disponibilidade dos dados, sendo de código aberto e mantido principalmente pela fundação Apache e sua comunidade.
Suas principais caraterísticas são:
Segundo a Apache podemos olhar para o Cassandra como sendo:
“O banco de dados Apache Cassandra é…
Docker and containers
Each time we need to create segregated and resilient environments capable of supporting from small applications to distributed databases, making intelligent governance of the computational resources of infrastructure and network simultaneously that we scale solutions horizontally, depending on the need of the scenario covered.
As a solution to this dilemma, containers appeared, according to Microsoft, they are “similar to virtual machines, but they do not create an entire virtual operating system. Instead, Docker allows the application to use the same Linux kernel as the system on which it is running. ”
Docker is one of the main…
We are adapting to the General Data Protection Law (LGPD); this new law has as main objective to guarantee data privacy and reliability, but how is the data area adapted to this new reality? What are the strategies adopted? What about Artificial Intelligence (AI)?
These are some of the strategies that are happening in the data area:
More and more, we have to deal with large volumes of data that are created and need to be used at an unbelievable Speed, having a huge variation, almost impossible for a human being to follow, to be concerned with its Veracity, and to be able to add value to the business in a way effective. (The 5 V of Big data).
To deal with this, the term “Big data” came up, and several solutions to deal with these problems in different scenarios, such as Apache Hive.
According to IBM, “Apache Hive is open source data warehouse…
This is a simple tutorial with examples of using Google Cloud to run Spark jobs done in Scala easily! :)
To check if Java is installed on your operating system, use the command below:
#Depending on the version of Java, this command can change … :)
If Java has not been installed yet, install from these links: Oracle Java 8, Oracle Java 11, or AdoptOpenJDK 8/11; always checking the compatibility between the versions of JDK and Scala following the guidelines of this link: JDK Compatibility.
2. Install Scala
I am sharing my opinion and what little I know of eventually here.